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Abstract 

On One or more searchers must capture an invisible evader hiding in the nodes of a graph. We 

study this version of the graph search problem under additional restrictions, such as monotonicity 
and connectedness. We emphasize that we study node search, i.e., the capture of a node-located 
evader; this problem has so far received much less attention than edge search, i.e., the capture of an 
edge-located evader. 

We show that in general graphs the problem of node search is easier than that of edge search, 
Namely, every edge clearing search is also node clearing, but the converse does not hold in general 
(however node search is NP-complete, just like edge search). Then we concentrate on the internal 
CS| monotone connected (IMC) node search of trees and show that it is essentially equivalent to IMC 

edge search; hence Barriere's tree search algorithm [2 , originally designed for edge search, can also 

be used for node search. 

We return to IMC node search on general graphs and present (several variants of) a new algo- 
Q rithm: GSST (Guaranteed Search by Spanning Tree). GSST clears a graph G by performing all 

^ its clearing moves along a spanning tree T of G. Because spanning trees can be generated and 

O cleared very quickly, GSST can test a large number of spanning trees and find one which clears 

G with a small (though not necessarily minimal) number of searchers. We prove the existence of 
^-H probabilistically complete variants of GSST (i.e., these variants are guaranteed to find a minimal 

I> IMC node clearing schedule if run for sufficiently long time). Our experiments also indicate that 

CN GSST can efficiently node-clear large graphs given only a small running time. An implementation 

of GSST (running on Windows and Linux computers) is also provided and made publicly available. 

cn 

•£} 1 Introduction 
O 

In this paper we study a variant of graph search. Our motivation comes from applied robotics problems, 
sjjl As an introduction, consider the following two problems, which can both be modeled as graph searches. 

^ 1. An evader is hiding in a building and a team of one or more searchers are trying to capture him. 

^ The building is represented by a graph G: nodes are rooms and edges are doors between rooms. 

Both searchers and evader occupy the nodes of the graph and move from node to node by sliding 
along the edges. The evader is captured when he is located in the same node as a searcher. It 
is assumed that the evader has the following characteristics: (a) he wants to avoid capture, (b) 
he is invisible to the searchers (unless he is located in the same node as one of them), (c) he is 
always aware of the searchers' locations and (d) he is arbitrarily fast. 

2. An evader and a team of searchers are located inside a cave. The cave is again represented by 
a graph G: edges are tunnels and nodes are the intersections of tunnels. The searchers move 
from node to node, sliding along the edges; capture takes place if a searcher slides along an edge 
occupied by the evader, or if the evader moves through a node occupied by a searcher. The 
previously mentioned properties of the evader are assumed to hold in this case as well. 
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The two problems are very similar but differ in one important respect. In the first problem the 
evader is assumed to reside in the nodes of the graph (the rooms); in the second problem he is assumed 
to reside in the edges (the tunnels). Accordingly, we will call the first problem node search and the 
second edge search. Edge- or node-location of the evader is an important aspect of the graph search 
problem and can be used to categorize the related literature. Historically, research has concentrated 
on the problem of ''^edge-located evader". To the best of our knowledge, very little work has been done 
on the problem of ^^node-located evader". Interestingly, the term "node search" has been used in the 
past to denote a version of the graph search for edge-located evader [25]. Hence, to avoid confusion, 
let us repeat that we use "node search" to denote the search for a node-located evader. 

Another important aspect of the graph search problem is the "visibility" of the evader. Namely, 
we say that the evader is visible if the searchers always know his location; we say that the evader is 
invisible if the evaders may become aware of his location only when he is in the same node (or edge) 
as they. 

Additional aspects of the graph search include whether it is internal (the searchers can only move 
along the edges of the graph) or not (a searcher can, in one step, move to any node of the graph - we 
call this ^^teleporting"), monotone (the evader can never return to a searched part of the graph) and 
connected (the searched parts form a connected subgraph). These terms will be defined rigorously in 
Section |2j 

We now present a brief overview of the graph search literature. Early papers include [6] where the 
problem of cave searching was posed and pi] where the first mathematical study of the problem was 
presented. The problem was independently rediscovered a little later in [32]. The edge search problem 
was placed in the context of graph theory in p9j. The already mentioned "node search" variant 
(actually dealing with edge-located evader) appeared a little earlier in [25]. Another variant, called 
"mixed search" (and again dealing with edge-located evader) appeared in [5] and was further studied 
in [351 EE]- The study of connected edge search started relatively recently; see [21 El [HI UHl HH EH]- 
There is very little published on "true" node search, i.e., the search for a node-located evader. There 
is a considerable literature on pursuit of visible node-located evaders, which we will not discuss, since 
visible evader search is outside the scope of the current paper; let us simply mention that an important 
early paper is [30] . Graph search is also related to several graph parameters, most notably pathwidth 
and vertex separation. These connections are discussed in many papers, e.g., in [H [HI |9l [251 |25l |36] . 
Finally there are several reviews of the graph search literature: an old and deep one is [3], a more recent 
one is [1] and a very recent and very extensive one is [12] . 

The above mentioned papers adopt a theoretical point of view. While there is much discussion 
about graph search algorithms, we have found few actually implemented algorithms which can tackle 
relatively large graphs (with the exception of tree search algorithms). A much more applied line of 
research appears in the robotics literature, for example in [151 [TH [IT] [27] a-iid the book [28 . These papers 
present actual implementations of algorithms, as well as numerical and even physical experiments, 
but they provide little (if any) theoretical justification of their methods. In particular, the distinction 
between edge and node search often appears to be misunderstood. We attempt to clarify this distinction 
and also combine the robotics and graph theoretic points in two technical reports we have previously 
published [201 [23]. 

In the current paper we study graph search for an invisible, node-located evader. We also examine 
connections to the problem of invisible, edge-located evader. We do not deal with the case of visible 
evader. We are mainly interested in internal, monotone, connected (IMC) node search. This version 
of the problem is the one most relevant to robotics applications which, as already stated, provide our 
main motivation. In general, we try to provide a balanced combination of theory and implementation. 

The main contribution of the paper is the introduction of (several variants of) the node-search 
algorithm GSST {Guaranteed Search by Spanning Tree). This algorithm is presented in detail, theoret- 
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ically motivated and analyzed and also practically evaluated by a number of numerical experiment^ 
The main idea behind the algorithm is that every IMC node search generates a spanning tree of the 
graph; conversely, every spanning tree can be used to perform a node search. Trees can be searched 
much more efficiently than general graphs; GSST exploits this fact to quickly compute a large number 
of searches and then output the one which requires the least number of searcher^ Our experiments 
also indicate that GSST can efficiently node-clear large graphs in a small running time. Prom the 
theoretical point of view, we establish several results about node search and its relation to edge search. 
These results are used to motivate the GSST algorithm and also to prove that some variants of GSST 
are probabilistically complete (i.e., they are guaranteed to find a minimal IMC node clearing schedule 
if given sufficient time). 

The paper is organized as follows. Preliminary concepts and notation are introduced in Section [2j 
edge search and node search are compared in Section [3] IMC node and edge search on trees is studied in 
Section |4j IMC node search on general graphs is studied in Section |5] where the GSST search algorithm 
is also introduced; the algorithm is evaluated by numerical experiments in Section [6j conclusions and 
future research directions appear in Section [7j Some edge search results by Barriere et al. [21 [3] are 
presented in Appendix |Aj the connection of node search to mixed edge search is discussed in Appendix 
[B| connections to pathwidth are discussed in Appendix [Cj an implementation of GSST as an actual 
executable program (which runs on Windows and Linux computers) is discussed in Appendix IdI 



2 Preliminaries 

2.1 Basic concepts and notation 

We denote graphs by boldface letters, e.g. G = {V, E) where V is the node set and E is the edge set. We 
will always label the nodes of G as 1^ = {1,2, iV} (hence the graph contains nodes, i.e. \V\ = N). 
Edges are denoted as {u, v} where u,v £V; often we will write uv instead, for simplicity, but note that 
vu and uv are the same edge (i.e. we study undirected graphs). We only consider connected graphs 
without loops or multiple edges. Also, to avoid some trivial cases, we will always consider graphs with 
at least one edge (and at least two nodes). 

Nodes u, v are said to be neighbors iff uv G E. 

Given a node sequence uiU2...ul, where UiUij^i G E for i = 1,2, L — 1, we say that uiU2...ul is 

1. a path iff n, ^ Uj for i,j £ {1,2, ...jL} and i ^ j; 

2. a cycle iff n, / Uj for i,j £ {1, 2, L} (and i ^ j) except that ui = ul- 

A tree is a connected graph without any cycles. Equivalent definitions are: a tree is a graph for 
which there is a unique path between every pair of nodes; a tree is a connected graph with N nodes 
and — 1 edges. The leaves of the tree are the nodes which have exactly one neighbor. 

A rooted tree is a tree with a distinguished node no called the root of the tree. Given a tree 
T = (y, E), we will denote the same tree rooted at uq by = {V, E, uq). Take any node ul ^ uq and 
let uqUi.-.ul-iUl be the unique path from no to ul', then ul-i is the parent of ul] ul is the child of 
ul-i. Given a node x, its children, their children and so on are the descendants of x. Given a rooted 
tree Tx = {V, E, x) and a node y, consider the node set 

y [y] = {z : z is y or a descendant of y} 

^An implementation of the algorithm (for Windows and Linux computers) is available in the public domain; for details 
see Appendix Id] 

^In fact GSST is an "anytime" algorithm '<A0 . Namely, the longer it runs the better solutions are provided, but a 
reasonable solution becomes available even in the initial stages of the algorithm. 
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and the edge set 

E [y] = {zu : zu £ E and z,u £ V [y]} ; 

note that both V [y] and E[y] are determined by both y and x. The graph (T^ [y] ,E[y]) is a tree (a 
sub-tree of T^^). The rooted tree (V [y] ,E[y], y) will be denoted by T^^ [y]. 

A search schedule on the graph G = (V, E) is a sequence of ordered pairs of nodes: 

S = ((n(l) ,t;(l)) , {ui2),v{2)),...,{u{tfin),v{tfin))), 

subject to {u{t) ,v (t)} G £' for t = 1, 2, We call S (t) = {u (t) ,v{t)) the i-th move (or step) of 
the search schedule. The t-th move can be any one of the following 

1. placing a searcher at node v (in which case u(t) = and v (t) G V) or 

2. sliding a searcher from node u{t) £ V to node v (t) £ V or 

3. removing a searcher from node u (in which case u{t) £ V and v (t) = 0). 

We will also use the more evocative notation S (t) = {u ^ v), i.e. a searcher is moved from node u 
to node v. Moves of the form: — > u (a new searcher is introduced in the graph) and u ^ (a searcher 
is removed from the graph) involve the fictitious "source node" (it is not an element of V), in which 
searchers are kept whenever they are not actively involved in the graph search; the evader does not 
have access to the source node. 

Given a search S, the number of searchers inside the graph at time t will be denoted by sn{S,t). 
The maximum number of searchers used by S will be denoted by sn(S), i.e. 

sn (S) = max sn (S,t) . 

These numbers must not be confused with the search number of a graph, which will be defined in 
Section iH 

Finally, we say that a node u is guarded at time t iff a searcher is located at u (at time t); 
otherwise we say u is unguarded. A path uiU2---ul is called n-unguarded (node unguarded) iff nodes 
Ui {i = 1,2,...,L) are unguarded (at time t); otherwise it is called n-guarded. The path is called e- 
unguarded (edge unguarded) iff nodes Ui {i = 2,3, ...,L — 1) are unguarded (at time t) and e-guarded 
otherwise. It is easy to see that 

UIU2-..UL is e-unguarded U2U3...UL-1 is n-unguarded, 
UIU2---UL is n-unguarded =^ uiU2---ul is e-unguarded. 

The reason for the two different definitions is that the first pertains to node recontamination and the 



second to edge recontamination, as will be seen in Section 2.2 



2.2 Node and edge search 

We repeat the assumptions introduced in Section [T] regarding the evader. Namely, the evader wants to 
avoid capture, he is invisible to the searchers (unless located in the same node), he is always aware of 
the searchers' locations and arbitrarily fast. The net result of all these assumptions is that (in both 
node and edge search) the evader can (and will) always avoid capture if an escape route is available. 
Hence, from the searchers' point of view, we can think of graph search as a process of eliminating 
escape routes. This is expressed as follows: 

1. a node / edge is dirty if it can possibly contain the evader and clear otherwise (e.g., a node 
occupied by a searcher is clear); 
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2. a previously clear node / edge can become dirty (e.g., when a previously guarded path between 
a clear and a dirty node becomes unguarded) - this is called recontamination; 

3. graph search is the process of gradually decreasing the dirty set (of nodes or edges) until it 
becomes the empty set (i.e., the evader has no escape route left). 

This is a worst case approach which essentially eliminates the evader from the graph search, intro- 
ducing in his place the dirty set. We can think of node (edge) search as a one-player "node game" 
("edge game"). In both these games the player controls all searchers, much like pieces in a game of 
chess. We now present the rules of the two games. For reasons which will be explained presently, we 
substitute the terms "clear" and "dirty" with the terms "n-clear" and "n-dirty" (in the node game) 
and "e-clear" and "e-dirty" (in the edge game) . 

Rules of the Node Game 

NO At time t = all nodes are n-dirty and no searcher is located in the graph. 
Nl At every time t = 1,2, ... the player performs one of the following moves: 

Nla place a searcher on a node, 
Nib remove a searcher from a node, 
Nlc slide a searcher along one edge. 

N2 An n-dirty node becomes n-clear when occupied by a searcher. 

N3 An n-clear node u becomes n-dirty when it is connected to to an n-dirty node v by an n-unguarded 
path. 

N4 An edge is n-dirty if it is adjacent to an n-dirty node; otherwise it is clear. 

N5 The game is concluded when all nodes (and consequently also all edges) are n-clear. 

Rules of the Edge Game 

EO At time t = all edges are e-dirty and no searcher is located in the graph. 
El At every time t = 1,2, ... the player performs one of the following moves: 

Ela place a searcher on a node, 
Elb remove a searcher from a node, 
Elc slide a searcher along one edge. 

E2 An e-dirty edge becomes e-clear when a searcher slides along the edge. 

E3 An e-clear edge uv becomes e-dirty when it is connected to to a e-dirty edge xy by an e-unguarded 
path. 

E4 A node is e-dirty if it is unguarded and adjacent to an e-dirty edge; otherwise it is e-clear. 
E5 The game is concluded when all edges (and consequently also all nodes) are e-clear. 
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Remark 2.1. The reason for using "n-clear" and "e-clear" (instead of simply "clear") is that an edge 
can be clear in the node game and dirty in the edge game. An example will illustrate this point. 
Consider the graph of Fig.l and the search schedule ^ 1, ^ 1, 1 ^ 2, 2 — > 4, 4 ^ 3. If this 
schedule is executed in a node game, after the final move all nodes are n-clear and so all edges are also 
n-clear. But in an edge game, after the final move, edge {1,3} has still not been traversed and hence it 
is still e-dirty. More generally, while "n-clear" and "e-clear" refer to similar physical situations, there 
is no a priori reason that their mathematical definitions are equivalent (in Section [4] we will show that 
they are equivalent provided certain conditions are satisfied). 



Figure 1: A graph in which node clearing and edge clearing are not equivalent. For example, placing 
a searcher into node (0 1), then placing a second searcher at 1 and sending him through 2, 4, 3 
(0^1, 1^2,2— >4,4— >3)isnode clearing but not edge clearing. 

Remark 2.2. Note that in the edge game clear edges are "recorded" at the completion of each move. 
This is important to note in the case that an edge is e-cleared and e-dirtied during the same move 
(i.e. for a single value of t). To clarify this, suppose that in the graph of Fig.l the following moves are 
executed: ^ 1 at t = 1; 1 ^ 2 at t = 2. In this case edge {1, 2} is first e-cleared (because traversed) 
and then e-dirtied, both during t = 2. After completion of the second move, edge {1, 2} is e-dirty. 

We now introduce more detailed notation regarding clear/dirty nodes/edges. For the node game 
we will use the following notation. 



We use exactly analogous notation for the edge game, but with an E (rather than N) subscript: (t). 



(t), E<^ (t), E'i (i), G% {t) = {V^ {t) , E^ (t)). Obviously we have 

V^{t)UV^{t) = V, V^{t)nV^{t)=(/}, E'^{t)UE^{t) = E, E^{t)nE§{t) = (!}, 
V^{t)UVi{t) = V, vE{t)r\VE{t)=%, E^{t)UE^{t)=E, E^{t)r\E^{t) = %. 





{u G y : at time t, u is n-clear } ; 

{"U G y : at time u is n-dirty} ; 

{uv G : at time t, uv is n-clear} ; 

{uv £ E : at time t, uv is n-dirty} ; 

(Vyv (^) 1 (^)) is the n-clear graph at time t. 



Remark 2.3. The n-clear / e-clear graph definitions are consistent, i.e. (V^ (t) , E^ (t)) / (V^ (t) , E^ (t)) 
are graphs. More specifically: 
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1. in the node game (t) consists of the n-clear nodes and ah edges between such nodes (these 
are exactly the set (i) ) ; 

2. in the edge game {t) consists of the e-clear edges and none of these can be left "dangling" , i.e. 
if uv G (t) then u,v G {t) (we will prove this as Lemma 
may contain guarded nodes adjacent to e-dirty edges; such nodes may appear as disconnected 
components of G^ (t). 

In the current paper we will concentrate on some restricted versions of the edge and node games. 
More specifically, the restrictions concern the allowable search schedules and they are the following. 



3.1 



in Section pi); in addition {t) 



1. We call a search schedule rooted iff searchers can be placed only into a single, prespecified node 
uq, called the root of the search. 

2. We call a search schedule internal iff searchers once placed into the graph (a) can only move along 
the edges and (b) are never removed from the graph. In other words, "teleporting" (the arbitrary 
movement of a searcher from one node to another, irrespective of the graph connectivity) is not 
allowec|3 

3. In the node game we call a search schedule S monotone iff the clear node set is never decreasing: 
V^' (t) C {t + 1) for all t. In other words, once a node is n-cleared it never becomes n-dirty 
again. The definition is similar for the edge game; i.e., S is monotone in the edge game iff 
E^ (t) CE^{t + 1) for all 

4. In the node game (resp. edge game) we call a search schedule connected iff the clear graph G^ {t) 
(resp. G^ (t)) is connected for t = 1,2, .... 

In Section [3] we will study general, unrestricted graph search; in the remaining sections we will 
concentrate on internal monotone {IM), internal connected (/C) and, especially, internal monotone 
connected {IMC) node / edge search. Our focus originates in our interest in practical pursuit / evasion 
problems which arise in robotics. The nature of the problem dictates the restrictions on the search. 

1. The internality constraint arises from the fact that a robot cannot "teleport"; it can only move 
through rooms and doors connecting these rooms; in the graph model of the physical situation 
this is reflected by the use of edge-sliding moves only. 

2. Similarly, rootedness reflects the fact that usualy an environment has a single entrance through 
which robots must enter. 

3. Connectedness and monotonicity are desirable (but not mandatory) characteristics of robotic 
search. For example, in a hostile environment it is preferrable that the cleared area consists of a 
single connected region, which is easier to guard and control than multiple regions. Similarly, if 
a high cost is associated with clearing an area, it is desirable that this area is cleared only once, 
i.e., that the search is monotone. 

^Since no searchers are ever removed during an internal search S, we have sn (S) — sn (S,t/i„) where tfi„ is the length 
of the search. 

*An important detail is worth mentioning at this point. There is a property, call it strong monotonicity, which 
characterizes schedules in which a traversed edge never becomes e-dirty again. The schedule of Remark |2.2[ is monotone 
(since E% (1) = E'j^ (2) = 0) but not strongly monotone. We stress this detail mostly for the sake of completness; in what 
follows we will not use strong monotonicity. 
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A given graph G with N nodes can always be node-cleared using searchers: just place one 
searcher in every node. Usually G can be cleared with much fewer searchers; the node search number 
of G, denoted by sn (G), is the minimum number of searchers required to node-clear G. Introducing 
additional constraints on the search schedule we get additional search numbers: 

min. nr. of searchers required to clear G with internal node search schedule; 

-//- IM node search schedule; 

-//- I C node search schedule; 

-//- IMC node search schedule. 

For the edge game we define the corresponding edge search numbers: se{G), s^(G), s^(G), 
si(G), sr(G). 

In case of a search rooted at node x, we will modify the notation as follows: sn (G; x) is the 
minimum number of searchers required to node-clear G by a search rooted at x; similarly for se (G;x), 

(G;x) and so on. If the graph G is a rooted tree T^;, then we will assume it is always searched by 
rooted searches; hence se (T^x) = se (T;x), (T^^) = (T;3;) and so on. 

A minimal node search schedule is one which clears G using sn (G) searchers; a minimal IM node 
search schedule is an IM schedule which clears G using sj^ (G) searchers and similarly for all other 
types of either node or edge search. 



shiG) : 

s%{G) : 
s'r (G) : 



3 Comparison of node and edge search 

In this section we compare "unrestricted" edge and node search. In other words, we compare the 
outcome of a search schedule S when it is used first in a node game and then in edge game, both played 
on the same graph G and without requiring rootedness, internality, monotonicity or connectedness. 



Our main result is Theorem 3.3 which, informally, says the following: every search schedule S gives 
at least as good results in the node game as in the edge game. More precisely, at every step t of the 
search, the set of e-clear nodes (and edges) is a subset of the set of n-clear nodes (and edges). An 
immediate corollary is that an edge clearing search schedule is also node clearing; The converse does 



not hold, as we have already seen by the example of Remark 2.1 
To prove Theorem |3.3| we will need the following lemmas. 



Lemma 3.1. Given a graph G and a search schedule S, for t = 0, 1, 2, ... we have 

ueVE (t) (Vux € E:uxeE^ (t)) (1) 
uv e (t) ^ {u,vG it)) (2) 

In other words: if a node u is e-dirty, then all edges ux, adjacent to u, are also e-dirty; and if an edge 
uv is e-clear, then nodes u,v are e-clear. 

Proof. For ([T]): since u is e-dirty, it must be unguarded. We distinguish two cases. 

(l.i) Suppose u is adjacent to a single edge, call it uv. If uv is e-clear then u is not adjacent to any 

e-dirty edges and hence must be e-clear, which is contrary to the original assumption. Hence uv is 

e-dirty. 

(l.ii) On the other hand, if u is adjacent to more than one edges, they cannot all be e-clear (then 
u would also be e-clear); so at least one edge, call it uw, is e-dirty. Suppose there is also an e-clear 
edge, call it uv. But then the e-unguarded path wuv connects e-dirty wu to e-clear uv, which leads to 
contradiction. Hence all edges adjacent to u are e-dirty. 

For suppose u e (t). Then u is unguarded and there exists some ux £ (t) and x ^ v. 
But the path xuv is e-unguarded and so uv G E^ (t) which is a contradiction. □ 
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Lemma 3.2. // a node u is e-cleared exactly at time t, then u is entered by a searcher at t. 

Proof. Suppose u £ V£ {t - l)nV^ (t). This can happen in two ways, (i) u was unguarded and e-dirty 
at t — 1 and then entered by a searcher at t. This is exactly the case described by the theorem, (ii) The 
other way for u to be cleared exactly at t is if n was unguarded and e-dirty at t — 1 and in addition a 
single edge, call it uxq, was e-dirty at t — 1 and e-cleared at t (while all other edges ux were e-clear at 
t — 1 and remained e-clear at t). For this to happen, the move at time t is either u — > xq or xq — > u. 
But u ^ Xq is not possible, because then u would be guarded and e-clear at t — 1. Hence the move at 
t is Xq ^ u and u was entered at t (which shows that (ii) is a sub-case of (i)). □ 

Now we are ready to prove the following. 

Theorem 3.3. Given a graph G and a search schedule S, we have 

for t = 0, 1,2, ... : (t) C (t) and (t) C (t) . (3) 

Proof. We will actually show the relationships (equivalent to (|3]) ) 

Vj^ (t) C V£ {t) , E§ it) C {t) . 

The proof is by induction. At s = all nodes are dirty in both the edge game and the node game, 
hence (0) = (0) = V. Suppose (s) C (s) for s = 1, 2, t. Now we take s = t + 1 and 
consider first the cleared and then the recontaminated nodes. 



I. Cleared Nodes. By Lemma |3.2[ a node becomes e-cleared exactly at t + 1 only if entered at t + 1. 
Hence at most one such node (call it xq) exists and, since entered at t -|- 1, it is also n-clear at 
t+1. In other words: xq G (t) n (i -h 1) ^ xq G V)^ {t + 1). 

II. Recontaminated Nodes. Denote by A the set of nodes recontaminated (in the node game) exactly 
at t -|- 1; i.e., A = (t) n {t + 1). By the definition of n-dirty node, every u A must have 
a path (n-unguarded at t -|- 1) to a node v S (t -|- 1). Furthermore, at least one u £ A must 
have a path (n-unguarded at t -|- 1) to a node v S Vj^ {t) (if this did not hold for any u £ A, then 
none of the nodes of A would get recontaminated to begin with) and in fact this shows every 
u A must have a path (n-unguarded at f -|- 1) to a node v G (t). So take any u A, then 
there exists a path viV2. -.vku satisfying: 

VI e (t) nv^{t + i), (4) 

viV2-.. vku is n-unguarded at t -|- 1. (5) 

Now we will show that vi must belong to {t)nV^ {t + 1) as well. First, vi e (t)C (t). 
Second, if ^ V£ {t + I) then vi was cleared exactly at t + 1, which (by Lemma 3.2 h means 



a searcher entered vi at t -|- 1; but then vi ^ {t + 1) which is contrary to ([4|. Hence vi G 



(*) ^ + -'-)• Since vi is e-dirty at time t -|- 1, viV2 is e-dirty at t -|- 1 by Lemma 3.1 Also 

viV2.. -Vku is e-unguarded at t -|- 1. Hence u G (t -|- 1). Since this is true for every u ^ A, we 
conclude 

ACVi{t + l). (6) 

If the move at t + 1 was placing or sliding a searcher into node xq, let B = {xq}; else let i? = 0. 
Since (t) C (t) we have 

V^{t)-BCVi{t)-B. (7) 

Since the e-dirty nodes at t + 1 include all the e-dirty nodes at t except the node xq (which was e-cleared 
at t -|- 1), we have 

Vi{t)-BCVi{t + l). (8) 
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And so we have 

vSit)-B^vE{t + l). (9) 

The n-dirty nodes at t + 1 are exactly the n-dirty nodes at t, excluding nodes n-cleared at t + 1, plus 
the nodes recontaminated at t + 1. In other words 

vS{t + l) = {vS{t)-B)^A (10) 

Combining eqs. ([6|, Q and ([To]) we get 

Vl^ {t + l) = {Vff {t) - B)^ A<^VE {t + l) . (11) 

Hence by induction we get {t) C (t) (and (t) C (t)) for t = 0,l, 2, ... . 

Let us now consider the edge sets (t) and (t). Take any t and any e dge u v € E^ (t). Without 



loss of generality we can assume u G (t) C {t) and hence, by Lemma 3.1 uv G E^ (t). Hence, 
for t = 0, 1,2, we have 

E§{t)QE^{t) and E'i (t) Q E^ (t) , 

which completes the proof. □ 

Corollary 3.4. Given a graph G and an internal search schedule S, if S is edge clearing it is also 
node clearing. 

Proof. Suppose the final move of S took place at time tfm, then E^{tfin) = 0. Since E^{tfin) C 
E^ {tfin) = we have E^ {tfin) = which also means {tjin) = (if a node u was n-dirty, an 
adjacent edge uv would also be n-dirty). □ 



Remark 3.5. From Theorem |3.3| we see that edge search is "weaker" than node search, i.e., for every 
search schedule, and at every time step, the clear set in the edge game is smaller or equal than the one 
in the node game. There is a variant of edge search, the so-called mixed edge search which, as we will 
see in Appendix [Bj is equivalent to node search, i.e. every search schedule produces the same clear and 



dirty sets at every step of both the edge and node game (this is stated and proved as Theorem B.l). 



From this fact follow two consequences. First, node search is NP-complete (since mixed edge search 



is NP-complete [S]). Second, Theorem 3.3 can be obtained as a corollary of Theorem B.l A more 



detailed discussion of these issues appears in Appendix |B} 

4 Search on Trees 

In this section we will focus on rooted IMC (internal, monotone, connected) search on trees and we 
will compare node and edge search in this context. 

Several authors have studied rooted IMC edge search [21 El [TTl [TJl [TJl . Our main results in this 



section are Theorems 4.1 and |4. 2 [ which establish the equivalence of rooted IMC node and edge search 
on trees. Hence several results already established for edge search also hold for node search (on trees) 
as will be seen in the sequel. 

Theorem 4.1. Given a tree T and an internal rooted search schedule S, if S is monotone connected 
in the node game then 

for t = 0, 1,2, ... : (t) = (t) and E^ (t) = E% (t) 

and S is monotone connected in the edge game. 
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Proof. The proof is inductive. We have (0) = (0) = and E'^ (0) = E'^ (0) = 0. Also, since 
the search is rooted and internal, the move at t = 1 will be the placement of a searcher into the root 
uo; hence (1) = (1) = {uq} and E^ (1) = E^j^ (1) = 0. Now suppose that at time t > 1 we have 

V^{t) = V^{t), E^{t)=E'^{t) (12) 

and let us consider the move u — > u , performed at t + 1, t > 1. 

We first dispose of the case where the move involves the node 0. Because of internality, v ^ 
(no searcher is removed from the graph). If the move is — > by rootedness we must have v = uq. 
Also by rootedness, uq is the first node cleared (at some t = 1). Now, in the node game: (a) by 
monotonicity, no node is recontaminated at t + 1 and (b) no node is n-cleared either (since uq remains 
n-clear, the move uq implies (t + 1) — (t) = 0); hence {t) = {t + 1) which also means 
E^ (t) = E^ (t + 1). In the edge game, — > -uq (a) does not clear any new edges (so it does not clear 
any new nodes either) and (b) does not remove any searchers from any node, so neither edge nor 
node recontamination is possible; hence (t) = (^ + 1) and E^ (t) = E^ {t + 1). In short, if the 



{t + l)-th move involves node 0, from (12) follows 



V^{t + l) = vS{t + l), E^{t + l)=E^{t + l). 

Next we examine the case where the {t + l)-th move is u ^ v, with u,v ^ 0. We will examine the 
effects of this move separately for the node and edge game. 

I. Node Game, u was guarded at time t, so G {t). Suppose, first, that at t only one searcher 
was located on u, so u is unguarded at t + 1. However, since S was supposed monotone in the node 
game, there is no node x G (t) n (t + 1) and, especially, u remains n-clear. Since no node has 
become n-dirty at i + 1, no edge has become n-dirty either (an edge becomes n-dirty iff incident on an 
n-dirty node). As for clearings, v is n-clear at t -|- 1, since entered by a searcher; and uv is also n-clear 
since {u,v} C {t + 1). No node other than v was entered at t -|- 1, so no node other than v was 
n-cleared. Is there an edge (other than uv) which was n-cleared exactly at t -|- 1? This is only possible 
if (a) the edge has the form vxi and (b) v E Vj^ (t) n {t + 1) {v became n-clear exactly at t -|- 1) and 
(c) xi £ (t) n {t + 1). Suppose xi ^ u. Since (t) is connected, there is a path uxl-i-..xi 
which is inside (t) and hence does not contain v. Then the tree T contains a cycle uxl^i...xivu 
which is a contradiction. Hence the only edge possibly n-cleared at t -|- 1 (if not already n-clear at t) is 
uv. In short 

(t + 1) = (t) U {v} and E'^ {t + 1) = E% (t) U {uv} . (13) 

The case that u is still guarded at t -|- 1 (i.e. u contained more than one searcher at time t) is omitted, 
since it is similar but easier to treat than the case of unguarded u. 

II. Edge Game. Again, we first examine the case that only one searcher was located in u at time t. So 
u was guarded (and e-clear) at t and is unguarded at t + 1. We will now show that u remains e-clear 
at t -|- 1; in other words that 

uev^ (t) nv^{t + i). (14) 

Suppose, on the contrary, that we have 

uevE (t) n (t + 1) ; (15) 



There are only two ways for (15) to hold, each of which we examine separately. 



II. 1 3x1 / V such that uxi G E^ (t) D E^ {t + 1). Then uxi £ E^ (t) = E^ (t). Since the move 
at t -|- 1 was u —>■ V, u was guarded at t and xi £ Vj^ (t). Now, from u — > t> and xi v we 
get xi G Vj^ (t -|- 1); from this and u unguarded at t -|- 1 we conclude that u G {t + 1) which 
contradicts monotonicity of the node search. 
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II. 2 Alternatively, there exists a path uxi...xk-iXk (with K > 1) such that xkxr-i S {t) n 
E^{t + 1) and uxi...xk-iXk is e-unguarded at t + 1. First note that, since uxi...xk is e- 
unguarded at t + 1 and the {t + l)-th move \s u ^ v, it follows that v ^ {xi, xx-i}; and since 
T is a tree, v / xk too; in short f ^ {xi, Xi^}. Also n ^ {xi, Xi^-}, since uxi...xk is a path. 
Now 

the {t + 1) -th move is u — > v 1 

u ^ {xi, ...,xk} > =^ wxi-.-Xft- is e-unguarded at t 

UXI...XK is e-unguarded at t -|- 1 J 

and this together with xkxr-i £ (t) implies xk-i £ (t) = (t). From v ^ xk-i 
follows that xk-1 was not entered at 1 and so xk-i S {t + 1). From this and uxi...xk-i 
being n-unguarded at t -|- 1 we conclude that u E + 1) which contradicts monotonicity of 

the node search. 



In short, u cannot become e-dirty at i -|- 1, i.e., we have proved (14). Using this we now will show 



that no previously e-clear edge xy can become e-dirty at t -|- 1 . Because this would require the existence 
of some edge pq G (t) H {t + 1) and a path xy...pq which is e-unguarded at t -|- 1 but e-guarded 
at t (otherwise xy would already be e-dirty at t). But any such path would have to contain u (no 
searcher was removed from any other node at t + 1) which would in turn imply u € {t + 1), which 
contradicts (14). Since no edge is recontaminated no node is recontaminated either (the only node 



guarded at t and unguarded at t -|- 1 is n, for which we have ( 14 ) 



On the other hand, the only edge possibly e-cleared at time t + 1 (if not already e-clear at t) is uv, 
since no other edge was traversed at t + 1. Hence the only node possibly e-cleared at time t + 1 (if not 
already e-clear at t) is v. 

In short we have shown 



(t + 1) = (t) U {v} and E^ {t + 1) 



E'i (t) U {uv} . 



(16) 



The treatment of the case that u is still guarded at t -|- 1 (i.e., at t it contained more than one searcher) 



is omitted, since it is similar but easier to the case of u unguarded. From ( 12 ) , ( 13 ) and ( 16 ) we obtain 



{t + l) = (t + 1) and E^ (t + 1) = E% {t + 1) (17) 

and, proceeding inductively, we obtain the required result: 

for t = 0, 1,2, ... : {t) = {t) and E^ {t) = E% {£) (18) 

Monotonicity and connectedness in the edge game follow from the fact that these hold in the node 
game and from eq.(18). □ 

Theorem 4.2. Given a tree T and an internal rooted search schedule S with root uq, ifS is monotone 
connected in the edge game and satisfies either of the following constraints 

CI E^ (1) = 0, E'^ (2) = {uov} , 

C2 E'^ (1) = 0, E<i (2) = 0, E'^ (3) = {uov}, 

then 

for t = 0, 1,2, ... : (t) = (t) and E^ {t) = E% {t) (19) 

and S is monotone connected in the node game. 
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Remark 4.3. Before proving the theorem, let us discuss the significance of constraints CI and C2. 
Three remarks must be made 



Regarding the kind of searches which will satisfy either of the constraints: CI will be satisfied by 
searches with S (1) = ^ uq, S (2) = no — > w (provided uq has a single neighbor, namely v); C2 
will be satisfied by searches with S (1) = ^ uq, S (2) = — > uq, S (3) = uq ^ v; in both cases 
Uq is, obviously, the root of the search. 



2. The constraints are imposed to exclude situations similar to the one discussed in Remark 2.2 
where an edge is e-cleared and e-dirtied during the same time step t. 

3. Finally, CI and C2 are not exceedingly restrictive; as will be seen later, every "interesting" IMC 
search on a tree either satisfies CI / C2 or can easily be converted to an equivalent search which 
does. 



Proof of Theorem The theorem will be proved by induction. It is easy to check that 

for t G [0, h] : (t) = (t) and (t) = (t) (20) 

with ti = 2 when CI holds and ti = 3 when C2 holds. Now suppose that at time t > ti we have 

V^it) = V^it), E%[t)=E%(t) (21) 

and consider the move performed at t + 1. 

We first dispose of the case where the move involves the node 0. Because of internality, v ^ {} (no 
searcher is removed from the graph). If the move is ^ f , by rootedness we must have v = uq. By CI 
and C2, uq is the first node cleared (at t = 1). In the edge game, at t + 1, no edge is recontaminated 
(by monotonicity) and no edge is e-cleared (since no edge is traversed); hence E^ (t) = E^ {t + 1); 
since no searcher is removed from a node, {t) = {t + 1) as well. In the node game, uq is already 
n-clear at ti and no other node is entered at i + 1, so no node is n-cleared; no searcher is removed from 
the graph, so no node is recontaminated; hence (t) = V^' {t + 1) which implies E^ (t) = E^ {t + 1) 



as well. In short, if the (t + l)-th move involves node 0, from (21 ) and the above arguments follows 



V^(t + l) = vS{t + l), E^it + l)=E^it + l). 

Next we examine the case where the {t + l)-th move is u — > u, with u,v ^ 0. We will examine the 
effects of this move separately for the node and edge game. 

I. Edge Game. By monotonicity of the search, no previously e-clear edge becomes e-dirty at t + 1; and 
hence the only node that can possibly become e-dirty is u. We will show however that u also remains 
e-clear. Since E^ (ti) = {uqv} and t > ti, from edge monotonicity follows that E^ (t) ^ for all 
t>t\. And, since u € (t) = (t) and Gg (t) is connected, there exists uz G E^ (t); if u becomes 
e-dirty at t + 1, then uz G E^ {t -\- 1), which contradicts monotonicity of the edge search. Hence no 
previously e-clear node becomes e-dirty at t -|- 1 . 

Regarding new clearings, edge uv and node v may become e-clear at t -|- 1 (if they were not already 
e-clear at t). No edge other than uv was traversed, hence no edge other than uv (and no node other 
than v) can be e-cleared at t + 1. 

In short, in the edge game we have 

{t + l) = VE (t) U {v} and E^ {t + 1) = E^ (t) U {uv} . (22) 

II. Node Game. Again, we first examine the case that only one searcher was located on u at time t. 
So u was guarded (hence n-clear) at t and is unguarded at t + 1. We will now show that u remains 
n-clear at t -|- 1; in other words that 

uev^ (t) nv^it + i). (23) 
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Suppose on the contrary that we have 

u£V^ (t) r^vS{t + l)■ (24) 



(24) can happen in only two ways, each of which we examine separately. 



11.1 3x1 / V such that uxi E E and xi G {t) r\V^ {t + 1). Then xi G (t) = {t) =^ uxi G 

(t) and hence (since the move at t + 1 was u — > f and xi 7^ uxi G i?^ (t + 1). 

Since E^ (t) / (for all t > h) and u G V)^ (t) = (t) and (t) is connected, there 
exists uz G E^{t). However, since uxi G £:|'(t + 1) and u is unguarded at t + 1, we conclude 
G E^ (t + 1), which contradicts monotonicity of the edge search. 

11. 2 Alternatively, there exists a path uxi...xk-iXk (with K > 1) such that xk G (t) nV;^ {t + 1) 
and UX1...XK-1XK is n-unguarded at t + 1 (so v ^ {u, xi, x^:}). First note that, since uxi...xk 
is a path, li ^ {xi, Xi^}. Now 

the {t + 1) -th move \s u ^ v \ 

u ^ {xi, ...,xk} > ^ uxi...Xii' is e-unguarded at t 

UXI...XK is n-unguarded at t + 1 J 

Since xk G (t) = {t) it follows that xk-iXk G -E'^ (i); since also xk~iXk is not traversed 
at t + 1, it follows that xk-ixk G E^ {t + 1). By the same reasoning as in the previous case, 
there exists uz G E^ (t) and, since uxi G E^ (t + 1) and u is unguarded at i + 1, we conclude 
uz G (t + 1), which contradicts the edge monotonicity of S. 



In short, u cannot become n-dirty at t+1, i.e., we have proved (23 ). Using this fact and an argument 



similar to that of Theorem 4.1 we conclude that no previously n-clear node can become n-dirty at 
t + 1. Since no node is recontaminated, no edge is recontaminated either. 

On the other hand, the only node possibly n-cleared at time t + 1 is v (no other node was entered) 
and hence the only edge possibly n-cleared at time t + 1 is uv (if another edge was cleared, by an 



analysis similar to that of Theorem 4.1 , we conclude that a cycle must exist in T, which is impossible). 
Hence the only edge possibly n-cleared at t -|- 1 is uf. 
In short, in the node game we have 

{t+l)= (t) U {v} and E'^ {t + 1) = E% (t) U {uv} . (25) 

The treatment of the case that u is still guarded at t -|- 1 (i.e., it contained more than one searcher at 



t) is omitted, since it is similar to but easier than the case of u unguarded. From (21), (22) and (25) 
we obtain 

{t + l) = {t + 1) and E'i {t + 1) = E% {t + 1) (26) 
and, proceeding inductively, we obtain the required result 

for t = 0, 1,2, ... : (t) = V^' (t) and E^ (t) = E^ (t) . (27) 

Monotonicity and connectedness in the node game follow from the fact that these hold in the edge 



game and eq.(27). □ 



We will now review some results (obtained by Barriere et al. [11 13]) regarding IMC edge search on 
trees and will show that similar results hold for IMC node search on trees. In the following presentation 
the terminology and notation of [21 13] is somewhat changed, to conform with the one used in the current 
paper. 

The first basic result of [2] is the following. 



14 



Theorem 4.4. /^/ For every tree T there is an IMC edge clearing search schedule S which uses (T) 
searchers. Moreover, in S all searchers are initially placed at the same node uq (i.e., the search schedule 
is rooted) and the first step (after placing the searchers) consists in clearing an edge incident to uq. 

Corollary 4.5. For every tree T we have (T) = s^^(T). There exists a rooted IMC search 
schedule which achieves this bound. 



Theorem 4.4 can be extended to the following sequence of inequalities [3]. 

Theorem 4.6. J^/ For every tree T we have. 

SE (T) = (T) = (T) < st (T) = s% (T) = (T) = sT (T) = st' (T) < 2se (T) - 2. (28) 

Furthermore, there are trees T for which the inequality s'^ (T) < (T) is strict. 

Barriere et al. also present the Search algorithm [2], which computes a minimal edge-clearing 
rooted IMC schedule S for every tree T. The next Theorem shows that S is also a minimal node- 
clearing rooted IMC schedule. 

Theorem 4.7. For every tree T, the search S produced by the Search algorithm is 

1. a minimal edge-clearing IMC search o/T, which is also rooted; 

2. a minimal node-clearing IMC search of T, which is also rooted. 

Proof. Part 1 of the Theorem is proved as Lemma 9 in [2^. Let us now prove part 2. Since S is an 
edge-clearing strategy it is also node-clearing. We next show it is minimal. 

Let us first consider the case of trees T such that s^*^ (T) = 1. It is easy to see that such trees 
are paths of the form uiU2..-U]\f, and Search produces the "obvious" edge clearing schedule: — > ui, 
ui — > U2, ... (or the reverse, starting at ujv) which is also a rooted minimal node clearing IMC schedule. 

Suppose now that s^"^ (T) > 2. By part 1, Search will produce a rooted minimal edge clearing 
IMC schedule S. Also, by relaxing the requirement that all searchers are placed into the graph in the 
first move (in other words, by interspersing searcher placements with edge slidings) we can obtain from 
S a new search schedule S' which 

1. uses the same number of searchers as S (i.e., sn(S) = sn(S')); 
1. is rooted (i.e. all searcher placements are into uq); 

3. satisfies either CI or C2 of Theorem 14.21 



4. produces the same sequence of clear graphs (t) as S (with a slight time adjustment, to 
account for the changes in searcher placement times). 



Hence S' satisfies the conditions of Theorem 4.2 and is IMC in the edge game which means that, 
using S', we have 



for t = 0, 1,2, ... : {t) = (t) and E'^ {t) = E% (t) . (29) 

From this also follows that S' is a node clearing rooted IMC search schedule of T. 

Now take another node clearing IMC search of T, call it S", which is minimal for the node game 



(i.e., sn(S") = s^'^(T)). Then, by Theorem 4.1, S" is also edge clearing, hence 
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On the other hand, since S' is node clearing, 



sn 



(S") = s'S"' (T) < sn (S') = sn (S) . 



Hence 



which completes the proof. □ 
Corollary 4.8. For every tree T we have s^j^"" (T) = (T). 



Remark 4.9. Theorem 4.7 shows that the Barriere Search algorithm can be used to compute a rooted 
IMC node clearing schedule for every tree T. How good is such a schedule? In other words, can we 
node clear a tree T with fewer than s^'^ (T) searchers? We can certainly do this with an internal 
(non-monotone, non-connected) search. For example, let T^'^*^) be the complete binary tree of height 
2M; then s% (T^^a^)) = M, while s'^" (t(2A^)) = 2M; but we do not know the values of s*^ (T^^*^)) 



and s*^ (t(2*^)). More generally we do not have the analog for node search of the inequalities fcSh. It 
is easy to see that 



sn{G) = s%{G)<s]^{G)< <sT{G) = sW^G) (30) 



for every graph G and, in particular, for every tree T (and the inequalities in (30) can be strict 



Refining (30) to something like (28) is a subject of our future research. The question is of special 
interest to us because, as already mentioned, the one absolute requirement for robotic pursuit / evasion 
is internality; monotonicity and connectedness are desirable but not indispensable. 

Remark 4.10. A basic component of Barriere's Search algorithm is the "A labeling" of edges, per- 
formed by the auxiliary Label algorithm [2 . As will be seen in the next section, the Barriere A labels 
are also used by our GSST algorithm to node-clear general graphs. 



5 Search on Graphs 

We now turn to the study of rooted IMC node searches of an arbitrary graph G = {V, E) (which will 
always be assumed to have nodes, i.e., \V\ = N). In every such search, the t-th move has the form 
S (t) = u ^ V, where u,v £ V or, perhaps, u = 0, the "source" node. 

We present several variants of a basic node clearing algorithm. All variants are guaranteed to 
find a node clearing search schedule; we prove that two of the variants will find a minimal schedule 
with probability 1 — a^^ where a G (0, 1) and M is the number of iterations of the algorithm; even 
non-minimal search schedules require a reasonably small number of searchers, as will be seen by the 
experiments of Section [6} 

5.1 Motivation 

Our basic algorithmic idea is motivated by the following rather simple observation: every rooted IMC 
node-clearing search of G generates a spanning tree. This observation can be refined in the form of 
the following theorem. 

Theorem 5.1. Given a graph G = (y,E) and a rooted IMC node clearing search S o/G. The clearing 
moves o/S generate a sequence of trees, (Tq, Ti, T2, T^v), where (for n = 1,2, N) T„ = (Vn, En) 
and the following hold: 

Dl To is the empty graph (% = Eq = 0J; 
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D2 Tjv is a spanning tree of G (Vn = V , ^ E); 

D3 for n = 1,2,...,A^; Vn^i ^ Vn, -En-i ^ ('m ot/ier words Tn-i is a subtree ofTn); 

D4 for n = 1,2, N : Vn = and for n = 2,3, En = En-iD{uiUn} , with i £ [l,n — 1]. 



Proof. Inductively. Since S is monotone, it involves N clearing moves. Tq is the empty graph. Ti is 
formed by the first move of S, which consists in placing a searcher at the root node. So Ti is the tree 
with a single node and trivially has Tq as a subgraph. Suppose D3 and D4 hold up to m = n and 
consider the (n + l)-th clearing move of S. Since S is connected, we add to T„ one node n„,_|_i and 



(as explained in the Proof of Theorem 4.1) exactly one edge UiUn+i (with i G [l,n\) to obtain a new 
tree Tn+i (of n + 1 nodes and n edges) which also satisfies D3 and D4. Hence D3 and D4 hold for 
m = 1,2, N. At m = N, Vn contains N nodes, hence Vn = V; since T^v is a tree, it is a spanning 
tree of G. □ 



Remark 5.2. An abbreviated statement of Theorem 5.1 could be: "Every rooted IMC node clearing 
search of G specifies a spanning tree of G and an order of clearing the nodes" . But the order must be 
consistent with the edge structure of the graph and the spanning tree (e.g., we cannot use an order of 
node clearing which requires non-existent edges). This consistence is exactly what conditions D3 and 
D4 describe. 



Theorem 5.1 gives the motivation for our algorithm GSST {Guaranteed Search with Spanning 



Trees), which is informally described below. 



Algorithm 1 GSST, Informal Description 
Input: Graph G 

Select a spanning tree T of G and a root uq of T 
Find a rooted IMC node-clearing search S' of T 
Apply S to G 

if at some step of S' a move u ^ v would result in recontamination in G then 

send a "guard" to u 

execute u — > u 
end if 

Output: The search S obtained by combining S' with the guard moves 



Note that, by construction, all clearing moves take place along edges of the spanning tree T. 

In the above description we have used the term "guard". Stated informally, the searchers play 
two roles: "iree searchers'"^ perform the clearing moves, always along the edges of the spanning tree; 
^^guards" are stationary and block potential recontamination paths. However, note that a particular 
searcher can change roles during the course of the search. 

The main advantage of the GSST algorithm is that it is fast. A random spanning tree T can be 
quickly generated and searched. Since T and G have the same node set, node-clearing T in a node- 
monotone manner results in node-clearing G as well. The main issue is: how many guards will be 
required to block recontamination through non-tree edges of Gl The number usuallly turns out to be 
quite reasonable, because (a) guards can be reused and (b) tree searchers can also be used as guards 
when they do not perform clearing moves. 

Both the labeling and the traversal phase of GSST can be executed in either a centralized or 
distributed manner; the latter is useful for robotic applications, where each robot can share some of the 
computational load. In the distributed implementation, all searchers share the underlying spanning tree 
and labeling. When a searcher reaches a node, he checks to see if he can move without recontamination. 
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If he can, he determines his next move based on the traversal strategy, and he shares this move with 
the team. 

Hence GSST has short execution time and can be run repeatedly (in reasonable time), using many 
different (randomly selected) spanning trees. Our algorithm depends on the quick discovery of a 



spanning tree corresponding to a minimal or near-minimal search (Theorem 5.1). Our experiments in 
Section |6] show that GSST discovers near-minimal searches in only a short time for several families of 
complex graphs. 

An important characteristic of the GSST algorithm is its "anytime-ness" . Anytime algorithms re- 
turn a partial answer before completion and they keep providing improved answers, the improvement 
increasing with computation time HQ]. GSST has these characteristics, as will be understood in Section 



5.2 Namely, GSST is characterized by monotonicity (the solution only improves over time), recog- 
nizable quality (the quality of the solution, i.e. number of searchers, can be determined at run time), 
consistency (the algorithm will not spend too much time finding a single solution), and interruptibility. 

5.2 The GSST Algorithms 



A detailed description of the GSST algorithm is given by the following pseudocode on p,19 The 
notation S = S| (n ^ w) means that the move (n — > v) is appended to the previously determined 
search schedule S (i.e., becomes the next move of S). Several subroutines appearing in the following 
listing will be discussed presently. 

The following remarks explain the operation of the algorithm. 

1. Generate a random spanning tree T by GenerateTree. We have used two different methods of 
random spanning tree generation. 

(a) The uniform method is an implementation of Wilson's f37] loop erased random walk algo- 
rithm. 

(b) The DFS method selects a root node and randomly moves down the tree in a depth-first 
manner. At each node, a random incident edge is chosen and set as an edge in the spanning 
tree. A visited list is maintained, and when a node is visited more than once, the edge used 
to reach it the second time is set as a non-tree edge. This eliminates cycles in the graph and 
thus generates a tree. When a leaf is reached, the algorithm recurses to ensure that all nodes 
are included in the tree (i.e, it is a spanning tree of the original graph). The motivation for 
this method is to bias towards spanning trees that require fewer guards. The intuition is 
that a DFS traversal will generate only a few nodes with non-tree edges, thus leading to few 
required guards. 

2. Label the edges of T by the R-Label algorithm (this, a modification of Barriere's Label algo- 
rithm, is listed and discussed in Appendix [A|) . 

3. While n-dirty nodes still exist, select an n-dirty edge uv of T (by SelectEdge, to be discussed 
presently). Let a searcher traverse uv if this does not cause node recontaminatioiij^ if no such 
searcher exists, then use a new searcher (originally placed at the root) to traverse uv. 

4. Repeat the process until all nodes are n-cleared. (Hence the algorithm does not allow node 
recontamination and will generate as many searchers as necessary to prevent it). 

5. When all nodes have been n-cleared, a rooted IMC node clearing search has been generated, 
which performs all its clearing moves along the edges of T. 

^The line "Move that searcher to u staying inside the clear graph" is actually a simplification, i.e., in the interest of 
brevity, we do not indicate how such a path is obtained; however this is always possible, usually in more than one ways. 
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Algorithm 2 GSST 



Input: G: a graph; M: no. of spanning trees to use. 

Smm — 

for m = 1 : M do 

S = 

T= GenerateTree ( G) 

Randomly choose root uq 
S = S|(0^7Xo) 

= {t/o}, i?^ = 
VS = V- {no}, i?^ = ii^ 



A=R-Label(T) 
while / do 

TO=SelectEdge(G^, T, E^, A) 

if a searcher can traverse uv without recontamination then 
Move that searcher to u staying inside the clear graph 
S = S\{u^v) 

Vc = V§U {v], E'^ = Ec^ {uv] 

Gat = {Vn > -^at) 
else 

S = S|(0^no) 
end if 
end while 

if sn{S) < 

Smin then 

Smin — S?T.(S) 

end if 
end for 

Output: Node clearing schedule Smm- 



6. Go back to step 1 and repeat the process with a new spanning tree. 

7. After the maximum number of trees and (corresponding searches) has been generated, return a 
search Smin which attains Smin, the minimum value of sn(S). 

The subroutine Select Edge chooses an n-dirty edge uv belonging to T and adjacent to the current 
clear graph G^. There are several ways to perform this selection. 

1. Labeled Selection (L). Select the next edge uv of T to be traversed according to the Barriere 
A labels (as in algorithm R-Search, presented in Appendix [A|) ; however, if traversing uv would 
cause recontamination then select the next edge in the Barriere sequence; if, at some stage of 
the search, traversing any edge of T would cause recontamination (i.e., if all searchers are stuck) 
then introduce a new searcher at the root. 

2. Labeled Selection with Randomized Tie-breaking (LR). Same as the previous except that 
ties of edge labels are broken randomly. 
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3. Randomized Selection (R). Choose uv randomly (without making any use of the Barriere 
labeling) by a uniform probability on all n-dirty edges belonging to T and adjacent to G^: 



Pr (uv) 



c if u is n-clear and uv is n-dirty and an edge of T; 
else. 



4. Labeled Weighted Selection (LW). This rule is intermediate between R and L: edge selection 
is still random but, instead of a uniform probability distribution, the probability of an edge uv 
being selected is inversely proportional to its Barriere A label. 

5. Label Dominated Selection (LD). This can be done by labeling edges that lead to parts 
of the graph that are trees (subtrees of the graph). A list of searchers who can move without 
recontamination can be maintained during search. If an edge adjacent to (t) leads to a subtree 
of the graph, and enough free searchers are available, clearing this subtree can only improve the 
search strategies. 

By using each of the above rules in the "basic" GSST algorithm, we obtain ten GSST variants: 
uniform GSST-L, uniform GSST-LR, ... , uniform GSST-LD, DFS GSST-L, DFS GSST-LD. These 
variants (except for the two GSST-R's) utilize the Barriere labeling, but do not necessarily produce a 
Barriere traversal of the spanning In a sense Randomized Selection is the simplest or most naive 

rule that can be used to select the next move of the search schedule: every n-dirty edge of T (adjacent 
to the clear graph) is equally likely to be selected. The remaining three rules can be understood as 
ways to bias the probability by which edges are selected in some meaningful way. The effectiveness of 
these rules will be judged by the experiments of Section [6| From the theoretical point of view, we will 
show in Section [5. 3| that the uniform GSST-R and uniform GSST-LD will find a minimal node clearing 
schedule with probability 1 — a^'^ where M is the number of iterations and a G (0, 1); we conjecture 
that this property does not hold for Barriere selection. As a practical matter, the issue is how large M 
has to be for 1 — a*^ to be sufficiently close to 1. However, the experiments of Section [g] indicate that 
the above rules find good search schedules in very reasonable time. 

Variants of GSST can also be produced by replacing the GenerateTree subroutine with an ex- 
haustive generation of all spanning trees of G (it can be used in conjunction with any of the above 
variants of SelectEdge). To do this we have used Char's spanning tree enumeration algorithm [21j. 
Exhaustive enumeration is feasible only for relatively small graphs. 

Finally note that the search schedules produced by (every variant of) GSST are IMC. This holds 
for the search of both T and G. Indeed, for the search of G to work, the search of T must be IMC. In 
other words, no obvious modification of GSST will produce, for example, an internal, connected, non- 
monotone node-clearing search of G. The question arises: how good is a minimal IMC node clearing 
of G (as compared to, for example, an internal but not monotone / connected node clearing)? This is 



the question already hinted at in Remark 4.9 



Let us close this section by repeating that the basic idea of GSST is to perform all clearing moves 
along the edges of a spanning tree. This idea exploits the facts that (a) spanning trees can be both 
generated and searched quickly and (b) blocking recontamination does not require an excessively large 
number of guards (because a searcher can change roles as a guard and a tree searcher). 



5.3 Completeness 

Definition 5.3. Given a graph G and a search S of G, the frontier at t (under S) is 

(t) = {u:u£ Fjv (t) o-nd 3v :v £Vj^ (t) ,uv £ E} , 

^This is true even of GSST-L, because an edge which would be next in Barriere's traversal order may be temporarily 
skipped if its traversal (at the current stage of the search) would cause recontamination. 
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i.e., the n-clear nodes which are connected to n-dirty nodes. 

Lemma 5.4. In a node search, for every t, the nodes u € (t) are guarded. 

Proof. A frontier node u is by definition n-clear and adjacent to an n-dirty node v. This is only possible 
if u is guarded. □ 

Definition 5.5. Consider a rooted IMC node clearing search S of G. let to = and suppose the 
clearing moves ofS take place at times ti, ...,t]\f ; let also to = 0. The n-th phase o/S (for n = 1,2, ...,N) 
is the time interval [tn-i + Ijtn]; i-s. the interval between the (n — l)-th and n-th clearing move. 

The following remarks are rather obvious. For m = 2,3, a target edge UiUm (with i G [l,m — 1]) 
corresponds to the m-th phase (here we take the root node to be ui). While t G [tm-i + l^^m] the 
algorithm moves a searcher towards uiUm- For t G [tm-i + Ij^m ~ 1]) UiUm is n-dirty, Ui is n-clear, Um 
is n-dirty. At t = 1^ we have S (t^) = Um and UiUm, Um are n-cleared. 

Lemma 5.6. Given a graph G = (V, E) and a rooted IMC node clearing search S of G, produced by 
either CSST-R or CSST-LW, let ti, t-2, ... , he the times at which clearing moves take place; let 
also to = 0. Then for n = 2, A'' : 

1. for t G [tn-l,tn - 1].- {t) = (tn-l); 

2. for t G [tn-i,tn — 1]: every u G (t) contains exactly one searcher, except one node u (t) which 
possibly contains two searchers; 

3. fort = t every u G ipn) contains exactly one searcher. 

Proof. The proof is by induction on n. Items 1, 2, 3 of the theorem hold for n = 1, t G [io) = {0, !}• 
Suppose they also hold up to the m-th phase. In the interval [tm + 1, tm+i] a target edge UiUm+i (with 
i G [1, m]) is selected and an available searcher is sent towards UiUm+i- Because GSST-R / GSST-LW 
always selects for clearing an edge adjacent to the clear graph, Ui G (tm), Um+i € Vj^itm). There 

are three cases. 

I. Ui is neighbor of a single n-dirty node, namely Um+i. In this case tm+i = + 1 (i-e., Um+i is 
n-cleared in one step) and [tm,tm+i] = {tm-,tm + !}• Node Ui ^ Vj^ (tm+i)', node Um+i niay or may 
not belong to Vj^ (im+i) but, at any rate, it contains exactly one searcher; no searchers enter or exit 
any other nodes, hence (by the inductive hypothesis) all u G (tm+i) contain exactly one searcher. 

II. Ui is neighbor of more than one n-dirty nodes, one of which is ttm+ii and there are free searchers. 
Since Ui is a frontier node, by hypothesis it contains a single searcher who, consequently, is stuck. 
However, if free searchers are available inside n-clear, non-frontier nodes, one of these searchers will 
be sent to mum+i by a sequence of moves. At every t during this sequence, the searcher may enter a 
frontier node w; for that particular t, w will be the only frontier node which contains two searchers. At 
t = tm+i — 1 the searcher will be located at Ui (which will now contain two searchers) and at trn+i he 
will enter -Um+i; this leaves at tm+i every u G (tm) with a single searcher and also places a searcher 
at Um+i. The only node which may have been added to the frontier is Um+i which contains a single 
searcher; every other node u G {t + 1) was previously in the frontier, contained a single searcher 
and, if a searcher entered u at some t e [tm + 1, im+i — 1]; it exited u at t + 1; hence every such node 
at tm+i contains a single searcher. 

III. The final case is when all frontier-located searchers are stuck and there are no searchers inside 
n-clear, non-frontier nodes. In this case a new searcher is placed at the root node and the rest of the 
analysis is identical to that of Case II. Hence items 1, 2, 3 also hold for [tm + l,im+i] and we can 
complete the induction for n = 1, 2, A?^. □ 
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Lemma 5.7. Given a graph G = {V, E) and a tree sequence (Tg, Ti, T^r) which satisfies conditions 
DI-D4 of Theorem 5.1 Then uniform GSST-R / GSST-LW with M = 1 (i.e., using a single spanning 
tree) has a nonzero probability of producing a search S which generates (To,Ti, ...jTjv). 

Proof. The probability that Algorithm 1 generates the tree sequence (To,Ti, ...,T7v) is 

N 



Pr(To,Ti,..,T;v) 



JJPr (T„|Tiv,To,...,T„_i) 



n=l 



Pr (To|T,v)Pr(T,v) 



Note that the conditioning in the above expression always includes T^, since this is the first choice 
made in running GSST-R / GSST-LW. Now obviously, Pr(To|TAr) = 1. By Wilson's Theorem 1 
[37], Pr(TAr) > for every spanning tree Tn- Also, Pr (T„|T^, Tq, Ti, T„_i) is the probability 
of expanding (at the n-th step) T„_i by the edge UiUn G E„ — E„_i which, by the construction of 
both GSST-R and GSST-LW, is always positive. Finally, Pr (TAr|T^, Tq, Ti, T^_i) = 1. Hence 
Pr (To,Ti, ...,Tn) > for every sequence Ti, ...,Tn. □ 

Lemma 5.8. Given a graph G = {V, E) and a rooted IMG node clearing search S o/G; let (Tq, Ti, Ttv) 
be the tree sequence generated by S. Let S' be a search produced by either GSST-R or GSST-LW and 
also generating {To,Ti, ...,Tj^). Thensn{S) >~sn{S'). 

Proof. The proof is exactly the same for GSST-R and GSST-LW, so we only prove the first one, by 
induction. Let ti, ...,tN be the clearing times of S and t'^, ...,t'j^ be the clearing times of S'. Also let 
to = t'o = 0. 

At to = t'o = we have sn (S', 0) = sn (S, 0) = 0. 

The only times at which sn (S', t) may change are 1, t[ + 1, ...,t']^_^ + 1 . Suppose that 

sn{S,tn) > sn (S',4) . 

Further, suppose that at -|- 1 a new searcher is introduced in S'. This can only happen (in the S' 
search) if all of the following hold: 

1. at exactly |V^(t)| searchers exist in G; 

2. there are no searchers inside nodes u G (t'„) — (t'„) (i.e., all searchers are located inside 
frontier nodes); 

3. all searchers are stuck (i.e., moving a searcher out of a frontier node u exposes u to recontamina- 
tion) . 

The sequence (Tq, Ti, T^v) along with the clearing times determines the frontier (t) for every 
t. Hence S' at has the same frontier as S at i„. If conditions 1-3 above hold in S', then every searcher 
is located in a frontier node and is stuck. It is possible that non-stuck searchers exist in S (located 
either in frontier or non-frontier nodes) but this also means that sn(S,t„) > sn(S',t'^) -\- 1; hence 
adding a searcher in S' at t'^-\- 1 preserves 

sn{S,tn) >sn(S',4 + l). 

Since no searchers are added in S' for t G -|- 2,t^_,_]^] and no searchers are ever removed in S (i.e., 
sn{S,tn+i) > sn{S,tn)) we also get 

sn (S, t„+i) > sn (S', . 
From the above inequality inductively we get sn(S,tN) > sn(S',t'j^) which proves the Lemma. □ 
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Theorem 5.9. Given a graph G = {V, E). 



1. Uniform GSST-R will generate a minimal rooted IMC clearing node search of G with probability 
greater than or equal to 1 — where M is the number of iterations and ai S (0, 1). 

2. Uniform GSST-LW will generate a minimal rooted IMG clearing node search of G with probability 
greater than or equal to 1 — where M is the number of iterations and G (0, 1). 

Proof. The proof is exactly the same for GSST-R and GSST-LW, so we only prove the first one. G 
has at least one minimal rooted IMC node clearing search S of G. Let (To,Ti, ...,Tjv) be the tree 



sequence generated by S. By Lemma 5.7 GSST-R has a nonzero probability, call it Pi, of generating 



in a single iteration a search S' with the same tree sequence as S. Then, by Lemma 5^ 

sn (S) > sn (S') . 

Since S is minimal, sn(S) = sn(S') and so S' is minimal too. Now, the probability of not generating 
S' in a single iteration is qi = 1 — and the probability of not generating S' in M iterations is 
(1 — while the probability of generating S' in M iterations is 1 — af^. □ 



Finally, let us mention that the GSST algorithm can be modified to produce an edge- rather than 
node-clearing search using the following theorem. 



Theorem 5.10. Given a graph G and an IMG node clearing search S using K searchers, there is an 
edge clearing search S' using either K or K + 1 searchers. 

Proof. Suppose that G contains Lq edges, that the length (i.e., number of moves) of S is tjin and that 
sn(S) = K. The new search S' will consist of the S moves (executed at integer times t = 1,2, tjin) 
combined with the moves of an extra searcher, the "edge cleaner", who will only (if at all) move at 
fractional time steps of the form t+ ^^^1^, I = 1,2,..., 3Lq (fractional times are introduced to preserve 
the "alignment" of S and S', i.e. to ensure S (t) = S' (t) at integer times; of course, once S' has been 
obtained, the time scale can be renormalized, so that all moves occur at integer times). We will use 
the notation t~ = t — g^^^-^ . 

We will describe the moves of of the edge cleaner on a step-by-step basis, for t = 1,2, ...,t fin in 
such a manner that at the same time we will complete an inductive proof of the fact that 

fort = l,2,...,t^,„+l : vS {t-l) = vE it-l), {t - 1) = Vi {t - 1) , {t-) = (t-) ; 

recall that is the frontier, i.e., the set of n-clear nodes connected to n-dirty nodes (and similarly 
for V£) and that the frontier nodes are always guarded in both the node and edge game. 

For t = 1 we have (0) = (0) = 0, (0) = V£ (0) = 0, and the edge cleaner is not used, so 
E^ (1") = E^ (0) = E^ (0) = E^ (1-). Now suppose that 

for s = l,2,...,t vS{t-l) = V^{t-l), V^{t-l) = Vi{t-l), E^ {t-) = E^ {t-) (31) 

and consider s = t + 1. Let the (t -|- l)-th move of S be u ^ u . We consider three cases. 

1. Case I. If u is an interior node (i.e., a non-frontier node: u G V^' {t — 1) — {t — 1) = 
{t — 1) — V£ {t — 1) ), then uv is an edge of G^ (t) = G^ (t) and no new nodes/edges are 
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cleared, either in the node or edge game. No path becomes unguarded, and so no recontaminaiton 
is possible either. Hence 

Vn it) = vS{t-l) = VE (t - 1) = VE it) , (32) 

{t) = v^{t-i) = vE{t-i) = vE it) . 

The edge cleaner is not used, hence also 

E§ {it + 1)-) = E§ {t~) = E§ (t-) = E§ {{t + 1)-) . (33) 

2. Case II. If u is a frontier node (u G (t — 1) = VE (t — 1) ) which contains a single searcher 
at t — 1, then u becomes unguarded at t. Clearly 

vEit) = vEit-l)U{v}. (34) 

It is also easy to see that there is no edge uy with y e Vff (* — !) = ^E ~ ^) ^^'^ U ¥^ if such 
an edge existed, then we would have y G VE it) (y was not entered at t) and so n G VE it)', but 
u G VE (^ — 1) (it was guarded at t — 1) and so node monotonicity of S would be violated. 

Take any edge pq G (t~) = E^ it~)- Edge pq cannot be recontaminated in the node game 
(no node was recontaminated). Edge pq cannot be recontaminated in the edge game either; for 
this to happen there must exist an e- unguarded path from pq to some xz G E^ it~)', but all such 
paths must go through u (no other node became unguarded at t) and hence must include uv (the 
only edge e-dirty at t~ and incident on u) but v is guarded at t. Hence no edge is e-dirtied at t 
and so no node is e-dirtied either. In other words 

VEit) = VEit-l)U{v}. (35) 



which, together with (34) shows that VE it) = VE it). Also, VE it) = VE it); in both the node 



and edge game, u was removed from the frontier and v was possibly added to it. In short, we 



have established (32) for Case II as well. 

From the previous remarks we know that, in the edge game and at time t, no edge was recon- 
taminated and edge uv was e-cleared. I.e., 

E'i (t) = E'^ {t-) U {vu} . 

In the node game, on the other hand, 

E^ (t) = E% (t-) U {vxo, vxi, ...,vxl} = E% {{t + 1)") (36) 

where L > 0, xq = u and there may exist other nodes xi, X2, xl which must (a) be neighbors of 
V and (b) belong to VE it — I) ^ VE it — 1). Hence xi, xl were guarded at t — 1 and remain so 
at t. In other words, at time t the edges vxi, ...,vxl have both endpoints guarded and are n-clear 
but e-dirty. Now we invoke the edge cleaner who (at times t + ^E)^i , t + ^Et+i ' • " • ) ™oves to v 
and then performs the moves v — > xi, xi — > — > X2, X2 — > f , ... , x^, — > v. This entire sequence 
can be performed in no more than 3Lq moves, so at t = {t + 1)~ the edges vxq, vxi, vxl have 
been e-cleared and so 

+ = E'^{t-)u{vxo,vxi,...,vxL}. (37) 



Combining (36) and (37) we get E^ ((t + 1) ) = E^ {{t + 1) ). In short, we have established 



(33) for Case II as well. 
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3. Case III. The final case to examine is when n is a frontier node which contains more than one 
searcher at t — 1. We omit a detailed treatment because the proof combines elements from the 
previous two cases; namely, recontamination does not happen (for the same reasons as in Case I) 
bu the edge cleaner may possibly be required (as in Case II). 



Hence, in all three cases considered, starting from (31) we have established (32) and (33). Hence 



we can complete the induction up to time t = tfin + 1 which means 

Vn {tfin) = {tfin) , Epji {{tfin + 1) ) = {{tfin + 1) ) • 



(38) 



Eqs.(38) imply that all nodes are e-cleared at tfin but a few extra steps may be required to e-clear all 
edges (by time {tfin + 1)~ at most). □ 



6 Graph Search Experiments 

In this section we evaluate the performance of the GSST algorithm by numerical experiments. Some of 



these experiments involve specific graphs (Section 6.1) and others involve families of graphs (Sections 



6.2 and 6.3 - in which case we present average results). We use ten variants of GSST, obtained by 
using two different methods of spanning tree generation (uniform and DFS) and five methods of edge 
traversal (GSST-L, GSST-R, GSST-LR, GSST-LW, and GSST-LD). 

6.1 Experiments using Individual Graphs 
6.1.1 Simple Graph 

The first graph we have used appears in Figj2j This is a relatively simple graph (with IMC node search 
number s^*^ =3) which we use to illustrate the basic principles of GSST operation. 



<5> 



Figure 2: A simple graph. 

We node-search the graph using the ten GSST variants, each with M = 5000 generated spanning 
trees; to each spanning tree corresponds a search S^, m = 1,2, ...,5000. In Table 2 we list for each 
GSST variant: (a) minm=i,2,...,Af (Sm) (i-e., the minimum number of searchers achieved by the 
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specific combination), (b) the proportion of minimal searclies out of the the M total searches and (c) 
the tima^ (in sec) required to run the M searches. 





Uniform ST generation 


DFS ST generation 


Edge Traversal 


Min 


Prop, of min 


Time 


Min 


Prop, of min 


Time 


GSST-L 


3 


0.1562 


0.203125 


3 


0.1804 


0.203125 


GSST-R 


3 


0.2031 


0.171875 


3 


0.2848 


0.203125 


GSST-LR 


3 


0.2656 


0.265625 


3 


0.2096 


0.265625 


GSST-LW 


3 


0.2292 


0.218750 


3 


0.2854 


0.250000 


GSST-LD 


3 


0.2031 


0.218750 


3 


0.3020 


0.234375 



Table 2. Node-clearing the "simple graph" by the various GSST variants: minimum number of 
searchers attained and proportion of minimal solutions; number of spanning trees generated is M = 
5 • 10^. 



The true node search number (i.e., 3) has been found by every variant of GSST. Generally, the DFS 
variants perform better than the uniform ones, as can be seen by the higher proportion of minimal 
solutions achieved. A better understanding of the distribution of the number of searchers required by 
each search can be obtained by looking at the histogram of the distribution; one such histogram (for 
the variant with BH edge traversal and uniform spanning tree generation) is plotted in Figjsj We can 
see that this simple graph has a high proportion of spanning trees which yield minimal schedules. 




Figure 3: The histogram of the distribution of minimum number of searchers required to node clear 
the "simple graph" by the uniform GSST-L variant. 

An additional tool to evaluate the anytime performance of GSST is the plot of R (m) = minj<m sn (Sj) 
(the minimum node clearing number of searchers achieved by the first m searches). R (m) is decreasing 
with m. The overall minimum achieved by GSST is R (M) (having tried M searches). If this minimum 
is achieved for a small value of m, then the minimal solution has been obtained quickly. A graph of 
R{m) (for the variant with labeled edge traversal and uniform spanning tree generation) appears in 
Figj4j as can be seen a minimal solution (clearing the graph with three searchers) is achieved by the 
seventh computed search, approximately at time t = 0.203125 • 7/5000 = 2. 843 8 x 10~^ sec. 

The "simple graph" is simple enough to have a relatively small number of spanning trees (namely 
272, as computed by KirchofF's theorem |18j ) and so we can also run GSST on all spanning trees 

^All computations were performed by the gsearch.exe program (see Appendix [d]), on a PC with Intel Dual Core E7500 
CPU, running at 2.93 GHz with 3 GB RAM; computation time is reported in seconds, unless otherwise indicated. 
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Number of spanning trees generated 



Figure 4: Plot of R{m) vs. m for the "simple graph" and the uniform GSST-L variant. The m axis is 
truncated at m = 20, since the rest of the plot shows no change. 

(exhaustive enumeration). The computation takes 0.03125 (for the GSST-L variant) sec and shows 
that 83 out of the 272 spanning trees {rooted at node 1) yield minimal searche^ 

6.1.2 Tree/Grid Graph 

The second graph we have used appears in FigjS} It consists of a "root" node and two branches 
under it; the left branch is a tree and the right one a grid; hence the name "tree/grid". The graph 
has sj^'^ =4 and we have deliberately designed it to "trick" the GSST algorithm. For the sake of 
definiteness consider GSST-L. If the root of the search is node 6, then GSST-L will find a four searcher 
node clearing IMC schedule. However, if the root is node 1, then GSST-L will only find a five searcher 
schedule, even after enumerating all spanning trees. A four searcher IMC node clearing schedule is 
possible from either starting node; but it requires the use of a non-Barriere edge traversal (for example, 
one produced by the GSST-R variant); but GSST-L will always first send the searchers down the right 
branch (towards the grid); actually, going first to the left branch, towards the tree, is better (i.e. yields 
a four searcher schedule) . 

Table 3 summarizes the results of our experiment, using the ten variants of GSST and M = 5 • 10^ 
spanning trees per variant. Note that in this table (as in all others) the starting node is selected 
randomly. The uniform variants are able to generate minimal search schedules, but not the DFS ones. 





Uniform ST generation 


DFS ST generation 


Edge Traversal 


Min 


Prop, of min 


Time 


Min 


Prop, of min 


Time 


GSST-L 


4 


0.00054 


5.921875 


5 


0.06380 


6.687500 


GSST-R 


4 


0.00328 


3.734375 


5 


0.06956 


4.390625 


GSST-LR 


4 


0.00076 


8.328125 


5 


0.06976 


8.765625 


GSST-LW 


4 


0.00206 


4.812500 


5 


0.07252 


5.609375 


GSST-LD 


4 


0.00232 


4.968750 


5 


0.09376 


5.656250 



Table 3. Node-clearing the "tree/grid" by the various GSST variants: minimum number of searchers 
attained and proportion of minimal solutions; number of spanning trees generated is M = 5 • 10^. 

*Note that all GSST variants, except GSST-L, use a randomized element in the order of edge traversal - hence multiple 
runs with the same spanning tree can yield diflterent searches. 
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Figure 5: The "tree/grid" graph. 

In this case the uniform variants find four-searcher clearing schedules and hence outperform the 
DFS generated ones, which can only clear the graph with five or more searcher^ Note also the lower 
proportion of minimal solutions, even for the uniform variants. For example, uniform GSST-L finds 
0.00054 • 50000 = 27 minimal solutions. This in an indication that this graph is indeed harder than 
the "simple" one, at least for the GSST algorithm. Also, the tree/grid graph has 31529 spanning trees 
(and correspondingly many cycles) which is an additional indication that it is (much) harder to search 
than the simple graph, which has 272 spanning trees. The small number of minimal solutions can also 
be appreciated by looking at the histogram (for uniform GSST-L variant it is plotted in Figj6|. 




Figure 6: The histogram of the distribution of minimum number of searchers required to node clear 
the "hard graph" by the uniform GSST-L variant. There is a small, barely noticeable, bar at /c = 4, 
corresponding to the four-searcher clearing schedules. 



However, despite the small proportion of minimal solutions, the first one is always found after a 
relatively small number of iterations. For example, the uniform GSST-L variant finds a four searcher 
schedule with the 420-th spanning tree generated (out of a total of 5 • lO'* spanning trees) , approximately 
at time t = 5.921875 • 420/50000 = 4.974 4 x lO'^ sec. 



^We have experimented with larger values of M (e.g., M — 10^) but the DFS variants still cannot achieve a four 
searcher clearing schedule. 
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6.1.3 NSH Graph 



The next graph we use has been obtained by discretization of an actual floorplan, namely the first 
floor of the Newel-Simon building in the Carnegie Mellon University campus. In FigjTjwe present the 
actual floorplan and its discretization; in Figjs] we present the resulting graph (the node numbers in 
FigJS^ correspond to the cell numbers in FigJ?]). 




Figure 7: The NSH floorplan. 




Figure 8: The NSH graph. 
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This graph, with 60 nodes and 64 edges, appears to be more complex than the previous ones; 
however it has a relatively small number of spanning trees (namely 3604, as computed by KirchofF's 
theorem). By visual inspection it appears very unlikely that the graph can be node-cleared with two 
searchers; on the other hand, as will be seen presently, GSST can compute three-searcher clearing 
schedules. Hence we conjecture that the search number s^^^ is three. 

Applying the "standard" variants of GSST and using M = 5 • lO'' spanning trees per variant we 
obtain the results of Table 4. 





Uniform ST generation 


DFS ST generation 


Edge Traversal 


Min 


Prop, of min 


Time 


Min 


Prop, of min 


Time 


GSST-L 


3 


0.00224 


14.265625 


3 


0.01038 


14.578125 


GSST-R 


3 


0.00002 


9.406250 


3 


0.00006 


10.140625 


GSST-LR 


3 


0.00478 


21.937500 


3 


0.01026 


21.390625 


GSST-LW 


3 


0.00328 


11.546875 


3 


0.00608 


12.593750 


GSST-LD 


3 


0.00544 


12.625000 


3 


0.00880 


13.578125 



Table 4. Node-clearing the NSH graph by the various GSST variants: minimum number of searchers 
attained and proportion of minimal solutions; number of spanning trees generated is M = 5 • 10^. 



We see that all the GSST variants using uniform spanning tree generation achieve the true search 
number, namely three. The random edge traversal variants perform poorest of all: the uniform GSST-R 
finds only one minimal search and the DFS GSST-R variant three. Generally, while the time required 
to complete M searches is higher for the NSH graph than for the "tree/grid" graph, the proportion 
of correct solutions is in some cases higher (at least for the uniform variants). A histogram of the 
searchers required appears in Figj9] for the uniform GSST-L variant; for this variant the first minimal 
search schedule is computed at step 845 (out of 50000), approximately at time 14.265625 -845/50000 = 
0.24109 sec. 




Figure 9: The histogram of the distribution of minimum number of searchers required to node clear 
the NSH graph by the uniform GSST-L variant. 



6.1.4 National Art Gallery Graph 

The next graph we use has also been obtained by discretization of an actual floorplan, namely the first 



floor of the National Gallery of Art, in Washington, DC. In Fig, 10 we present the actual floorplan and 



the discretization we have used; in Fig, 11 we plot the corresponding graph 
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Figure 10: The National Gallery of Art floorplan. 




This graph, has 70 nodes and 93 edges and is more complex than any of the previously used graphs. 
In particular, it is more complex than the NSH graph. While the National Art Gallery graph has 
about 50% more edges than the NSH graph, it has a much larger number of spanning, approximately 
5.3-10^^, as compared to 3604; this precludes use of the GSST with exhaustive enumeration of the 
spanning trees. Also, we have no obvious way to compute the true node search number. Nevertheless, 
we proceed to apply the ten GSST variants using M = 1.5 • 10^ spanning trees per combination. The 
results obtained appear in Table 5. 





Uniform ST generation 


DFS ST generation 


Edge Traversal 


Min 


Prop, of min 


Time 


Min 


Prop, of min 


Time 


GSST-L 


5 


0.0000066 


74.378237 


5 


0.0004466 


77.214947 


GSST-R 


6 


0.0013133 


45.500000 


5 


0.0004800 


50.984375 


GSST-LR 


5 


0.0000066 


78.172631 


5 


0.0002000 


87.250000 


GSST-LW 


5 


0.0000200 


62.859375 


5 


0.0007600 


77.500000 


GSST-LD 


5 


0.0000066 


64.656250 


5 


0.0009800 


77.546875 



Table 5. Node-clearing the National Art Gallery graph by the various GSST variants: minimum 
number of searchers attained and proportion of minimal solutions; number of spanning trees generated 
is M = 1.5 • 10^ 



We see that all the GSST variants 
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find node clearing searches with five searchers. While we 
cannot be sure that the true node search number of the National Art Gallery is five, we have been 
unable to find a lower search number using any method (including extensive inspection by the authors). 
In addition, the middle part of the graph resembles a five-by-six grid, which is known to have a node 



search number of five (see Section 6.3). A histogram of the searchers required appears in 12 for the 
uniform GSST-L variant; for this variant the first minimal search schedule is found at step 101823 (out 
of a total 1.5 • 10^1. Similar results hold for the other variants. 



12 3 4 5 




No 01 searchers 



Figure 12: The histogram of the distribution of minimum number of searchers required to node clear 
the National Gallery of Art graph by the uniform GSST-L variant. 

The National Art Gallery graph results (as well as the NSH ones) show the applicability of GSST 
on graphs that are derived from representations of real indoor environments (which has been the main 
motivation for our research). The methods that incorporate Barriere labeling improve performance on 
the NSH map because it is similar to a tree (it becomes a tree if we remove only a few edges). In this 
case, Barriere labeling, which is based on trees, helps to improve the schedules generated by GSST. 

'^"Except for uniform GSST-R; however 5-searcher clearing schedules can also be found by this variant if it is run for a 
sufficiently large M. 
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The National Art Gallery, on the other hand, is more similar to a grid, which lessens the advantage of 
using Barriere labeling to guide traversal (though a significant improvement is still obtained) . 



6.2 Interval Graphs 



The next experiment presented involves, unlike the ones of Section 6.1 a large number of graphs 
belonging to the same family. Our goal is to evaluate the average performance of the GSST algorithm. 
While, strictly speaking, the results are specific to the family of interval graphs, they also suggest 
general properties of the GSST algorithm. 

To evaluate the performance of GSST we must have some estimate of the actual search number 
of each graph we use; then we can compare this search number with the minimum sn (S) achieved by 
GSST. To satisfy this condition, we will work with interval graphs. Briefly, an interval graph is obtained 
from a system of intervals, i.e. a collection of intervals of real numbers; each interval corresponds to 
a graph node and two nodes are connected by an edge iff the corresponding intervals intersect. It is 
known |10|[22] that the (non-monotone, non- connected) edge search number of an interval graph G (i.e., 
se (G)) is equal to its interval width, which is defined to be the size of the largest clique of the graph. 
It is also known that the interval width of an interval graph G of nodes can be computed in time 
0{N) |10| |22] . Of course, while we know the edge search number se (G) exactly, this only provides an 
estimate of the IMC node-search number s^'^ (G), in which we are really interested. There is no strict 
inequality connecting se (G) and s]^'^ (G); we just know that sn (G) < se (G) and sn (G) < s^'^ (G). 
In general, we expect that the discrepancy between se{G) and s^'^(G) is not too large and hence 
se (G) can be used to evaluate GSST performance. 

We randomly generate interval systems (and the corresponding interval graphs) by the following 
mechanism. First we select two parameters of the family: A^, the number of intervals, and A, the 
average interval length. Then we generate A^ intervals, the n-th interval having its left endpoint at n 
and the right endpoint at n + 5, where 5 follows an exponential probability law f {5) ~ e^^l^. We 
form the interval graph G corresponding to this interval system and compute its interval width. We 
repeat the process 100 times to obtain 100 interval graphs; these form a family characterized by the 
parameters A, A (specified by us) and also by the average number of edges and the average interval 
width. We repeat the process for five different choices of (A, A); this information is summarized in 
Table 6. 



A 


A 


Average num. of Edges 


Average Interval Width 


30 


3 


74.22 


5.49 


30 


5 


113.06 


7.66 


25 


7 


182.01 


9.51 


35 


10 


231.84 


11.68 


40 


15 


366.79 


15.53 



Table 6. Characteristics of the various families of interval graphs used for the experiments. 



Finally, we apply the ten GSST variants to each of the five familes (using M = 2 • 10"^ spanning 
trees per variant). The results are summarized in Tables 7. a (for the uniform variants) and 7.b (for the 
DFS variants). In these tables every row corresponds to a family, the families being indexed by their 
average interval widths (appearing in the first column). Each of the remaining columns corresponds to 
one GSST variant and lists the average (over the 100 graphs) minimum number of searchers achieved 
by the respective variant. Hence, an estimate of the efficiency of the GSST variants can be obtained by 
comparing the first column of each table to the remaining ones. For example, in the first row of Table 
7. a we see that all uniform GSST variants node clear the graphs of the first family with fewer searchers 
(on the average) than those "predicted" by the interval width of the graphs (e.g., 5.15 < 5.49). As 
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we proceed down the rows of Table 7. a to graphs of higher interval width (and, presumably, of greater 
complexity) the average number of searchers required to clear a graph increases above the average 
interval width. For example, in the last row the average interval width is 15.53 and the average 
minimum number of searchers required by the uniform GSST-L variant is 19.05; in other words the 
uniform GSST-L requires ^^'^i^^^'^"^ = 22.66% more searchers than expected by the interval width 
estimate. Things get better with the DFS variants; for example, for the (A^ = 40, A = 15) family, the 
searcher overhead incurred by the DFS GSST-L variant is ^"^'^i^^^'^"^ = 13.84%, which is actually quite 
good for graphs of such high complexity. 



Av. Int. Width 


GSST-L 


GSST-R 


GSST-LR 


GSST-LW 


GSST-LD 


5.49 


5.15 


5.32 


5.36 


5.14 


5.16 


7.66 


7.82 


8.08 


8.03 


7.80 


7.87 


9.51 


11.01 


11.42 


11.45 


11.02 


11.30 


11.68 


13.61 


13.91 


13.91 


13.60 


13.63 


15.53 


19.05 


19.37 


19.50 


19.06 


19.25 



Table 7. a: Average minimum number of searchers required to node-clear each family of interval graphs 
by the uniform GSST variants; number of spanning trees generated is M = 2 • 10^. 



Av. Int. Width 


GSST-L 


GSST-R 


GSST-LR 


GSST-LW 


GSST-LD 


5.49 


5.04 


5.13 


5.04 


5.08 


5.06 


7.66 


7.34 


7.47 


7.39 


7.44 


7.36 


9.51 


10.23 


10.34 


10.25 


10.25 


10.24 


11.68 


12.70 


12.80 


12.64 


12.73 


12.62 


15.53 


17.68 


17.87 


17.75 


17.72 


17.66 



Table 7.b: Average minimum number of searchers required to node-clear each family of interval graphs 
by the DFS GSST variants; number of spanning trees generated is M = 2 • 10^. 



The results on interval graphs show that GSST can yield near-minimal schedules on a large class of 
graphs. They also demonstrate that GSST scales well with increasing complexity both in performance 
and computation time. This is a direct result of the linear scalability of the algorithm in the number 
of nodes in the environment. 



6.3 Grid Graphs 
6.3.1 Full Grids 

We now present some experiments involving grid graphs, i.e. graphs with nodes located at points with 



integer coordinates (an example appears in Fig. 13). It is easily seen that a grid graph of dimension 



Ji X J2 (i.e. containing nodes with coordinates (ji,j2) S {li2,..., Ji} x {1, 2, J2}) can be node 



cleared using Jq = min(Ji, J2) searchers; the corresponding search schedule is obvious However, 
for a general graph search algorithm (such as GSST) which must work without assuming any special 
structure of the graph, grid graphs are potentially extremely hard, because they have many cycles and 
many spanning trees, only a few of which correspond to minimal search schedules. 

In this experiment we use 6 grid graphs, with dimensions starting at 5 x 5 and going up to 10 x 10. 
To each such graph we apply the ten GSST variants; in Tables 8. a (uniform ST generation variants) 
and 8.b (DFS ST generation variants) we list the minimum search number attained by each variant on 

^^Namely (assuming that the height Ji of the grid is less than or equal to the width J2) place Ji searchers on one of 
the vertical sides of the grid and slide them horizontally to the other vertical side. 
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each graph (M, the number of spanning trees generated per graph is also hsted, in the last column). 
Let us also note that the total execution time for this experiment is approximately 25 hours, reflecting 
(a) the large number of graphs used, (b) the high complexity of many graphs, (c) the large number of 
spanning trees used (the m,uch smaller execution time for each individual graph and GSST variant is 
not listed, for economy of space). 



Uniform random spanning tree generation 


Graph Dim. 


GSST-L 


GSST-LR 


GSST-R 


GSST-LW 


GSST-LD 


No. Trees 


5x5 


5 


5 


5 


5 


5 


10^ 


6x6 


7 


7 


7 


7 


7 


2-105 


7x7 


8 


8 


8 


8 


8 


3-105 


8x8 


10 


9 


9 


9 


9 


3-105 


9x9 


11 


11 


11 


10 


11 


4-105 


10x10 


13 


13 


13 


13 


13 


5-105 



Table 8.a. Minimum search number attained by the uniform GSST variants on the full-grid graphs. 



DFS random spanning tree generation 


Graph Dim. 


GSST-L 


GSST-LR 


GSST-R 


GSST-LW 


GSST-LD 


No. Trees 


5x5 


6 


6 


6 


6 


6 


105 


6x6 


7 


7 


8 


7 


7 


2-105 


7x7 


9 


9 


9 


9 


9 


3-105 


8x8 


11 


11 


11 


11 


11 


3-105 


9x9 


12 


13 


12 


13 


12 


4-105 


10x10 


14 


14 


14 


14 


15 


5-105 



Table 8.b. Minimum search number attained by the DFS GSST variants on the full-grid graphs. 



We see that, on the grids, the uniform variants perform better than the DFS ones; this is exactly 
the opposite situation from what happens in interval graphs. With few exceptions, the uniform variants 
(Table 8. a) either attain the actual minimum search number (for the 5x5 graph) or incur a small 
overhead of 1 or 2 extra searchers. The situation changes with the 10 x 10 graph, where the minimum 
attained search number is 13, with a 30% overhead over the true s^*^ = 10. However, the 10 x 10 
grid graph is very complex, with 100 nodes, 180 edges and 5.6943-10^^ spanning trees. Judging from 
the distribution of the attained search numbers (which, for economy of space, is not displayed here) 
lower search numbers can be attained but they require a much larger number of spanning trees. To 
get an idea of the complexity of the full grid graphs, we list in Table 8.c. the number of spanning trees 
for each of the six graphs used. This provides a measure of the hardness of node-clearing the graph. 
This actually quite reasonable: a graph with many cycles (and many spanning trees) is harder to clear 
because it contains many escape routes for the evader. 



Graph Dim. 


No. of spaning trees 


5x5 


5.57560-10^ 


6x6 


3.2566-10^3 


7x7 


1.9872-1019 


8x8 


1.2623-10^*^ 


9x9 


8.3266-10^^ 


10x10 


5.6943-10^^ 



Table 8.c. Number of spanning trees for each of the full-grid graphs 
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On the positive side, we see that node-clearing schedules can be computed for the full grid graphs 
using slightly more than the minimum number of searchers and in reasonable computation time. For 
example, the uniform GSST-L variant finds the first 8-searcher clearing schedule for the 7x7 graph in 
under 2 mins and an 11-searcher clearing schedule for the 9x9 graph in under 10 mins. Considering 
that (due to the large number of cycles) grid graphs are potentially some of the hardest graphs for 



GSST we find these results to be quite satisfactory. 



6.3.2 Depleted Grids 

Our final experiment involves what we call "depleted grid graphs". These are obtained as follows. 
First we choose two parameters, Ji and J2, the length and width of the grid. Just like with the full 
grid graphs, we place nodes at the positions (ji, J2) £ {l, 2, Ji} x {1, 2, J2}. We also connect all 
nearest neighbor nodes along the following lines 

1. one horizontal line: {(ii, I)}ji6{i,2,...,j2}> 

2. Ji vertical lines: {(ii, j2)}j2e{i,2,...,J2}' 3i = 1,2,..., Ji, 



obtaining a tree of the form indicated by the solid edges in Fig, 13 Finally, we consider all pairs 
of nearest neighbors (^1,^2), (ji, ^2) which are not already connected and add to the graph an edge 
connecting each such pair with probability p. 



Figure 13: Generating a depleted grid graph. The solid lines correspond to edges which are always 
present in the graph. Each dotted line becomes and edge with probability p. 

The resulting graph G is a subgraph of the full Ji x J2 grid and has s^'^ (G) < min (Ji, J2); in 
other words, min(Ji, J2) yields an upper bound (useful for estimating the performance of GSST) of 
s^ff^^ (G). It seems reasonable that, on the average, sjp'^ (G) is an increasing function of p and for p = 1 
the bound is tight, i.e. s^^^ (G) = min ( Ji, J2); it will be useful to keep this in mind when evaluating 
the results of the experiment. Note also that, as p approaches 0, the graph becomes more similar to a 
tree (has fewer cycles). 

We generate twelve families of depleted grids; namely we use dimensions 5 x 5, 7 x 7, 8 x 8, 10 x 10 
and p values 0.4375, 0.7500 and 0.9375, yielding twelve combinations. We generate fifty graphs from 
each family and apply to each of these the ten GSST variants, using 2-10^ spanning trees for each 
graph. The results appear in Tables 9. a (uniform variants) and 9.b (DFS variants). Namely, for each 

^^It is interesting to note that a somewhat related fact is weh known in the probabihstic inference literature: inference 
is easier on graphs with low tree- width [?]• 
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combination we present the minimum search number attained, averaged over the fifty graphs of the 
corresponding family. It can be seen that the resulting numbers are quite low, especially for the uniform 
variants, often going under the min(Ji, J2) bound. 



Graph Dim. 


P 


GSST-L 


GSST-R 


GSST-LR 


GSST-LW 


GSST-LD 


5x5 


0.4375 


2. 86 


2.86 


2.86 


2.86 


2.86 


5x5 


0.7500 


3.68 


3.68 


3.50 


3.48 


3.48 


5x5 


0.9375 


4.53 


4.53 


4.12 


4.18 


4.20 


7x7 


0.4375 


3.63 


3.56 


3.64 


3.57 


3.62 


7x7 


0.7500 


5.24 


5.14 


5.32 


5.08 


5.06 


7x7 


0.9375 


6.62 


6.40 


6.56 


6.30 


6.28 


8x8 


0.4375 


4.22 


4.14 


4.08 


4.08 


4.00 


8x8 


0.7500 


6.10 


6.14 


5.90 


5.86 


5.84 


8x8 


0.9375 


7.70 


7.76 


7.52 


7.40 


7.46 


10x10 


0.4375 


5.44 


5.50 


5.32 


5.30 


5.28 


10x10 


0.7500 


7.94 


8.02 


7.84 


7.68 


7.80 


10x10 


0.9375 


10.20 


10.40 


10.24 


10.06 


10.94 



Table 9. a. Average minimum number of searchers required to node-clear each family of depleted-grid 
graphs by the uniform GSST variants; number of spanning trees generated is M = 2 • 10^. 



Graph Dim. 


P 


GSST-L 


GSST-R 


GSST-LR 


GSST-LW 


GSST-LD 


5x5 


0.4375 


3.36 


3.36 


3.36 


3.36 


3.36 


5x5 


0.7500 


4.38 


4.36 


4.36 


4.36 


4.36 


5x5 


0.9375 


5.04 


5.04 


5.02 


5.04 


4.98 


7x7 


0.4375 


4.24 


4.24 


4.24 


4.24 


4.24 


7x7 


0.7500 


5.82 


5.78 


5.84 


5.80 


5.77 


7x7 


0.9375 


7.24 


7.30 


7.28 


7.28 


7.32 


8x8 


0.4375 


4.68 


4.66 


4.66 


4.63 


4.64 


8x8 


0.7500 


6.56 


6.62 


6.62 


6.64 


6.62 


8x8 


0.9375 


8.58 


8.52 


8.62 


8.54 


8.56 


10x10 


0.4375 


5.80 


5.80 


5.78 


5.78 


5.80 


10x10 


0.7500 


8.62 


8.58 


8.68 


8.72 


8.62 


10x10 


0.9375 


11.34 


11.38 


11.54 


11.38 


11.26 



Table 9. b. Average minimum number of searchers required to node-clear each family of depleted-grid 
graphs by the DFS GSST variants; number of spanning trees generated is M = 2 • 10^. 

The depleted grid results show the high performance of GSST on graphs with large numbers of 
spanning trees and many cycles. Better results are obtained for smaller values of p, i.e. when the graph 
becomes more similar to a tree; this is to be expected, since a basic component of the GSST algorithm 
is search along a spanning tree. It is interesting to note that even for a quite high value of p, namely 
0.9375, results (at least for uniform GSST variants) are quite good; taking into account also the results 
of the full grids, it appears that the increase of attained search number over min ( Ji, J2) happens only 
when p gets very close to 1. Once again we observe that the uniform variants perform better than the 
DFS ones. 
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6.4 Discussion 



Our experiments demonstrate the performance of GSST on several different classes of graphs. On many 
complex graphs, GSST was able to find a minimal search schedule in reasonable time. On all graphs, 
at least one near-minimal schedule is computed early in the execution of the algorithm. In fact, the 



"anytime" operation of GSST (see also the discussion in Section 5.1) yields reasonably good solutions 
in very short time and then keeps improving on these solutions as long as additional computation time 
is available. While the minimal schedules form a small percentage of the computed solutions, many 
more near-minimal solutions are available. These properties of GSST are highly desirable, especially 
given the fact that (as far as we are aware) no other algorithm has been implemented which can tackle 
graphs of the size and complexity presented here. 

We can highlight some specific conclusions supported by the experiments on the various graphs 
considered in Section 16.11 - 16.31 



1. The NSH and National Art Gallery graphs show the application of GSST on graphs that are 
derived from representations of indoor environments. The methods that incorporate Barriere 
labeling improve performance on the NSH map because it is similar to a tree. The National 
Art Gallery, on the other hand, is more similar to a grid, which lessens the advantage of using 
Barriere labeling to guide traversal. 

2. The results on interval graphs show that GSST can yield near-minimal schedules on a large class 
of graphs. They also demonstrate that, with increasing complexity, GSST scales well both in 
performance and computation time. 

3. Finally, the grid results show the high performance of GSST on graphs with large numbers of 
spanning trees and many cycles. These are potentially some of the hardest graphs for GSST. 
Also, it appears that the the increase of attained search number over min ( Ji, J2) happens only 
when p gets very close to 1. 

Comparing spanning tree generation methods, we see that DFS generation performs better on all 
graphs considered except for grid graphs (and the tree/grid graph) where the uniform ST generation 
is better. At this point we do not have a good explanation but we intend to further research this fact 
because we believe it will give us a better understanding of which are the "good" spanning trees, i.e. 
the ones associated with minimal searches. See Section [7] for some additional remarks on this issue. 

Regarding edge traversal methods, there is no single GSST variant which consistently outperforms 
all others. However, uniform GSST-L never does too poorly and is fast; GSST-LW has the same 
advantages and, in addition, is provably complete. Hence, in case node-clearing schedules must be 
quickly produced (i.e. computation time is at premium), one can use a "reduced suite" of GSST 
variants, consisting of GSST-L and GSST-LW (with both uniform and DFS spanning tree generation) 
and expect to obtain nearly as good results (at a fraction of the computing time) as when using the 
"full" suite of the ten GSST variants. 

Let us also stress that all variants of GSST can attain lower search numbers than the ones presented 
here if sufficient running time is available. 



7 Conclusion 

Motivated by the problem of robotic pursuit / evasion, in the current paper we have investigated 
node search, i.e., the capture by a team of searchers of an invisible evader located in the nodes of a 
graph. This problem has so far received little attention in the graph theoretic literature, with most 
related publications concentrating on the problem of edge-located evader. The basic contributions of 
the current paper are of two kinds. 
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1. From the theoretical point of view, we have shown that, in general graphs, the problem of node 
search is easier than that of edge search, in the sense that every edge clearing search is also a 
node clearing one; but the converse does not hold in general. We have then concentrated on 
the internal monotone connected (IMC) node search of trees and shown that it is essentially 
equivalent to edge search under the same restrictions; hence Barriere's tree search algorithm, 
originally designed for edge search, can also be used for node search. 

2. From the algorithmic point of view, we have presented GSST, a new algorithm which performs 
IMC search on general graphs. This algorithm is based on the fact that every node clearing 
search generates a spanning tree; hence node-clearing a preselected spanning tree (by the use of 
tree searchers) and simultaneously blocking recontamination (by the use of guards) monotonically 
node-clears the graph. Because spanning tree generation and search can both be performed very 
efficiently, a large number of spanning trees can be tried by GSST until the one yielding the 
minimum number of searchers is discovered. Experiments have shown that GSST can quickly 
node-clear large and complex graphs using a small number of searchers. 

Many issues touched upon in the current paper require additional research. We conclude by listing 
some future research directions, categorized as follows. 

1. Theory. 

(a) Optiniality. Given a graph G with IMC node search number s^ff"^ (G). Suppose we have K 
searchers with K > s^^^{G). What is the best we can do? Of course "best" depends on 
some optimality criterion. For example we may want to clear the graph in the minimum 
number of steps. Or, we may want a search strategy which minimizes the cumulative "area" 
of the dirty set (i.e., X^^i (0)- We would like to obtain algorithms which solve such 
optimization problems. 

(b) Our experiments show that the uniform GSST variants perform better than DFS on grids 
and grid-like graphs. We want to discover a mathematical explanation of this fact, e.g., to 
find necessary and / or sufficient conditions under which uniform GSST outperforms DFS 
GSST. A result of this type will probably have useful computational implications - see also 
item 2. (a) below. 

(c) We want to establish a sequence of inequalities between node search numbers. In other 



words, to refine (30), either for a general graph G, or for the special case when G is a tree. 



For the latter case, we conjecture that we can re-establish the Barriere et al. inequalities 



(28) so as to hold for node- rather than edge-search numbers. 

(d) Finally, we are interested in questions of consistency. Suppose that we are given a graph 
along with a configuration of each nodes (i.e., a specification of the clear and dirty nodes). 
Can this configuration be obtained by a sequence of moves in the node game? In the edge 
game? Also, given a configuration of clear and dirty nodes and another one of clear and 
dirty edges, are these configurations compatible in the node game? In the edge game? 

2. Computation. The GSST algorithm searches the space of spanning trees in a random manner. 
Is there a way to bias the search in some useful way? For this we need a characterization of "good" 
spanning trees, i.e., trees which yield a low number of searchers (this includes tree searchers and 
guards). We have no such characterization at present. Perhaps we can obtain it by understanding 
why uniform variants perform better than DFS on grid graphs (and conversely on interval graphs). 

3. Extensions. We want to study theoretically and develop algorithms for the following variants 
of the search problem. 
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(a) For less restricted types of node search, e.g., internal monotone (but not connected), internal 
connected (but not monotone); or for completely unrestricted node search. 

(b) For different types of pursuer / evader behavior; e.g., for pursuers with extended (non-local) 
visibility, evaders with finite speed etc. 

(c) How does the graph search problem look from the evader's point of view? Given a graph G 
and K searchers, what is the best the evader can do if i^' < s (G)? What if K > s (G)? In 
the latter case capture is guaranteed, but can the evader, for example, maximize the number 
of steps until capture? Many other interesting problems can be posed from the evader's point 
of view. 

Acknov^^ledgement. Wc gratefully acknowledge Sanjiv Singh for many useful discussions and valuable 
comments on the current paper. 
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A Appendix: Edge Search on Trees 



In this Appendix we present the R-Label and R-Search algorithms, which perform rooted IMC search 
of trees and are also used by the GSST algorithm. R-Label and R-Search are simplified versions 
of corresponding algorithms presented by Barriere et al. [2]; some theoretical results from [21 are also 
reviewed. In our presentation the terminology and notation of |;2] are somewhat changed, to conform 
with the current paper. 

The next Lemma and Definition play a crucial role in computing the IMC search number of a tree. 

Lemma A.l. |0 Given a rooted tree T^. and a node y ofTx, ifu has more than one children, enumerate 
them as yi,y2, ■..,yK, so that they satisfy s'^" {T^, [yk]) > (T^; [yk+i]) for k = 1,2, K - 1. Then 

st" (T. [y]) = max (s^ (T. [yi]) , s^" (T. [ya]) + l) • (39) 

Definition A. 2. Given a tree T = {V, E) and a node x £ V, consider all the edges xy incident on 
X and label them as follows. 

1. If y is a leaf, then \x {xy) = 1. 

2. If y is not a leaf, let zi, Z2, zk be the neighbors of y other than x, enumerated so that Xy {yzk) > 
Xy (yzk+i) (for k = 1,2, K — 1) and define 

Ax {xy) = max (Aj^ {yzi) , Xy {yz2) + 1) ■ (40) 



Note that the above definition assigns two labels to each edge xy, namely Xx {xy) and Xy{xy). 
Intuitively, A^; {xy) is the number of searchers that must cross the directed xy edge in an IMC edge- 
clearing of the tree. In |2j Barriere et al. present the Label algorithm which computes the labels 
{Xx {xy)}xy^E- Furthermore they prove the following. 

Lemma A. 3. /^/ For every tree T and every edge xy o/T, we have s^*^ (T2:[y]) = A^^ {xy). 
Lemma A. 4. ^] Given a tree T = {V, E), for every x £ V define fi {x) as follows 



H{x) 



Ax {xyi) if X has a single neighbor yi 

max (Ax {xyi) , Ax {xy2) + 1) otherwise; 



in the second line of the definition, yi,y2,--- are the neighbors of x, enumerated so that they satisfy 
(Tx [yk\) > st" (Tx [Vk+i]) for k = l,2, ... . Then 

sf^c (T^) = ^ {x) and s^^ (T) = min ^ {x) . 

xe V 

The algorithms and results of [2] are geared towards computing s^'^(T). The GSST algorithm, 
on the other hand, requires the computation of s^'^ (Tx) and rooted searches. Hence we introduce 
the R-Label and R-Search algorithms, which are simplified, "rooted" versions of the corresponding 
Barriere algorithms. The pseudocode of these algorithms is listed in the next page. 

The R-Label algorithm takes as input a tree T = {V, E) and a root node uq £ V. The algorithm 



is a straightforward implementation of Definition A.2[ R-Label does not compute all the labels 



{Ax {xy)}xy£E o^^y ones for which y is a child of x in the rooted tree T„q; for these edges, 
R-Label produces the same A labels as Label. R-Label uses two subroutines: given that uq is 
the root, the subroutine Depth(T,uo) returns a partition of the node set V, where V^^'^ 

contains all nodes of depth /; the subroutine SortChildren(2/, T, uq, X) returns the children of y sorted 
in decreasing order of their A labels. 
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The R-Search algorithm uses the A labels to produce a rooted IMC (edge and node) clearing search 
of T. R-Search is almost identical to Barriere's Sesirch, the only difference being that the starting 
node no is given, rather than chosen by the algorithm (this implies that R-Search produces a minimal 
rooted IMC search of T„„). R-Search uses Barriere's subroutine Move. The notation S = S| {u,v, J) 
means: append J moves of the form u ^ v to the search schedule S. 

Algorithm 3 R-Label(T, no) 

Input: Tree T = (V, E), start node uq 
{F«}f=i = Depth(T,no) 
for I = L — 1 with step —1 until 1 do 
for X G V^^") do 

for y G SortChildren(x, T, Uq, A) do 

[zi, Z2, zk] = SortChiIdren(?/, T, uq, A) 
if 'Leng;th.{[zi, Z2, zk])=0 then 

K{xy) = 1 
else if Length([zi, Z2, ...,zk]) = 1 then 

else 

K{xy) = Taayi{Xy{yzi), Xy{yz2) + 1) 
end if 
end for 
end for 
end for 

Output: Edge labeling {Xx{xy)}xyeE 



Algorithm 4 R-Search(T, ;/(), A) 

Input: Tree T = {V,E), start node uq, edge labeling A 
S = 

[yi,y2, ...,yK] = SortChildren(no, T, mq, A) 
for k = K with step —1 until 1 do 

S=Move(x, yk, \j:{xyk),S) 
end for 

Output: Search S 
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Algorithm 5 Move(«, v, J, S) 

Input: Tree T = {V, E), start node uq, edge labeling A 
S = SKu,!;, J) 

[■wi,W2, ■■■,wk] = SortChildren(t;, T, iiQ, A) 
for k = K with step —1 until 1 do 

S=Move{v, Wk,Xy{ywk), S) 
end for 
S = S\{v, u, J) 
Output: Search S 



43 



B Appendix: The relationship between node search and mixed edge 
search 



Theorem 3.3 tells us that edge search is "weaker" than node search in the sense that, for every graph 
and every search schedule, we have V^{t) C {t) and {t) C E^{t). We will now consider a 
variant of edge search, the so-called mixed edge search which, as we will see, is equivalent to node 
search. 

Mixed edge search is obtained by augmenting the clearing rules. In the ^^Mixed Search Edge Game'''' 
an edge can be cleared not only by sliding, but also by guarding both its endpoints. More precisely, to 
obtain the rules of the mixed edge search game, take the rules of the edge game (presented in Section 
[2]), change the terms "e-clear" and "e-dirty" to "m-clear" and "m-dirty" and add the following rule: 

E2' An m-dirty edge uv becomes m-clear when both u and v are occupied by searchers. 

In other words, we can clear an edge either by traversing it or by guarding both its endpoints. 

We will use (t) to denote the set of m-clear nodes at time t, and will analogously use the 
notations Eg (t), (t), E^j (t), G^^ (t) etc. 

We emphasize that, in the mixed search edge game, the evader resides in the edges of the graph 
and recontamination occurs by the same rules as in the "plain" edge game. 

Theorem B.l. Given a graph G and an internal search schedule S. Then, we have 

fort = 0,1,2,...: {t) = V^, {t) and E'^ {t) = Eg (t) . (41) 

Proof. The proof is inductive and consists of several steps. 

I. At t = we have 

^jv (0) = l^M (0) = and E^{0)=Eg{0) = $. (42) 

Suppose that we have 

for s = 0, 1, ...,t : (s) = V^j (s) and Eg (s) = Eg (s) . (43) 

and let us next consider time t -|- 1 . 

II. We will now prove that 



E^{t + l)<ZEg{t + l). (44) 



To this end we will show two things. 



II. a If a previously n-dirty edge becomes n-clear at t + 1, then it also becomes m-clear. Suppose that 
E§ (t) nE^{t + l). In other words, uv is n-cleared exactly at t + 1; this means that one 
of u,v was already n-clear by t and the other was n-cleared exactly at t + 1. Without loss of 
generality, assume 

u£V^ it)nv^ (t + i), v£ (t) r\vS{t + 1) (45) 

Since a node can be n-cleared only by moving into it, the {t + l)-th move must be w — > w. We 
distinguish the following cases. 

(1.1) Suppose w ^ u. Since v G {t) and u € {t), u was guarded at t; since w ^ u, u 
remains guarded at t -|- 1. Hence, ai t + 1, uv has both ends guarded and so uv E Eg [t + 1). 
(Note that this analysis holds even when w = 0, i.e., when the move was to place a new searcher). 

(1.2) Suppose w = u and u is still guarded at t -|- 1 (i.e., at t there was more than one searcher 
in u). Then at t -|- 1 both u and v are guarded and uv G Eg [t 
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(i.3) The remaining possibility is that w = u and u is unguarded at t + 1 (i.e., at t there 
was exactly one searcher in u). Note that u — > f at t + 1 means that u is guarded at t and 
hence u G (t). If n G {t + 1) as well, then uv G + 1). So we now will show that 

u G (t + 1) is not possible. Indeed u G Vj^ {t + 1) can only happen in one of the following 
two ways. 

(i.3.1) There exists a node xi v such that uxi G E^^ (t) n E^^ {t + 1). But uxi G E^^ (t) = 
E^ (t) implies xi G (t) (b ecause u is guarded at t) and xi G Vj^ {t) implies xi G {t + 1) 
(because xi was not entered at t + 1). Finally, from xi G (t + 1) and u unguarded at t + 1, 



we conclude u G (t + 1) which contradicts assumption (45). 



(i.3. 2) Alternatively, there exists a path vuxi...xk (with K >2), e-unguarded at i + 1 and with 
xk-iXk £ E^ {t)nE^ {t + 1) (note also that, since vuxi...xk is a path, {u, w}n{xi, X2, ...,xk} = 
0). In this case, we have 



and 



and finally 



VUXI...XK e-unguarded at t + 1 
u ^ V ai t + 1 
u ^ {xi,X2, 

UXI...XK e-unguarded at t 

XK-lXK G E^.^ (t) 



UXI...XK is e-unguarded at t; 



XK-ieVB{t) = V^{t); 



D 



XK-i G (t) 
u ^ V at t + 1 

V ^ {xi,X2, ■.■,Xk} 

UXI...XK-1 is n- unguarded at t -|- 1 



XK-i e (t + 1) 



But u G Vj^ (^ + 1) contradicts assumption (45). Hence u ^ V[l {t + 1) 
In short, by examining cases (i.l)-(i.3) we have shown that 

uv G E^ (t) nE^{t + l)^uve Eg (t + l). 
i.e., if at t -|- 1 some uv is n-cleared it is also m-cleared. 



(46) 



II. b If a previously m-clear edge becomes m-dirty at t + \, then it also becomes n-dirty. Suppose 
that uv G (t) n E^.^ [t + 1). Without loss of generality, we can assume that there exists a 
xk~iXk £ E^,j {t) n Ef^ (i + 1) and a path uvxi...xk which was e-guarded at t but became e- 
unguarded at t + l. Since the path became e-unguarded at t + 1, exactly one node in it was 
guarded at t (and became unguarded at t + l). Call this node x-^, with A; = if the node in 
question is v (i.e., xq = v) and k G {1, 2, K — 1] otherwise. The move at 1 is x^r ^ z, where 
z ^ {f , xi, xk-i] (since uvxi...xk is e-unguarded at t + l). Since xk-iXr £ E^ (t) = E^ (t), 
either xk-i G VJ^ (t) or xk £ Vff (t). We consider the following cases. 

(11.1) If XK~i G Vn (.i) tlien x^-i G {t + 1) as well (it was not entered) and the path 
vxi...xk~i is n-unguarded at t -|- 1. So u G Vff {t + 1) and uv G E^ {t + l). 

(11.2) If xk-1 ^ (t), then xk G (t) and so xk-i is the guarded node in the path uvxi...xk 
(i.e., k = K — 1). We distinguish two subcases. 

(ii.2.1) The move at t -|- 1 is xk-i z and z ^ xk- Then xk S {t + 1) and so xk-i S 
Vff {t + 1) and, by the same argument as in (ii.l), we get uv G E^ (t -|- 1). 
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(ii.2.2) The move at t + 1 is xk-i — > xk- Since we have assumed xk-iXk £ E^j {t)r]E^ {t + 1), 
xk-iXk must be adjacent to some other edge xk-W £ {t) n E^j {t + 1); so we can use y in 
place of Xk, uvxi...XK-iy in place of uvxi...xk and conclude by the reasoning of case (ii.2.1) 
that uv £ E^{t + l). 



In short, we have shown that 



uv e Eg (t) n Eg {t + 1) 



uv 



(^Eg{t + 1), 



(47) 



i.e., if at t + 1 some uv is m-dirtied then it is also n-dirtied. 



Hence every edge added at t + 1 to E^ (t) is also added to Eg (t) and every edge removed at t + 1 
from Eg (t) is also removed from E^ (t). These facts, combined with Eg (t) = Eg (t), yield 



E'^{t + l)OEg{t + l) 



III. To strengthen (48) to set equality, we need the reverse set inclusion. In fact we can prove 

for s = 0,l,...,t,t + l,... : Eg{s) Q Eg (s) 



(48) 



(49) 



by following the proof of Theorem 3.3 and replacing "e-clear", "e-dirty", ... with "m-clear", "m-dirty", 
... (analogs of Lemmas 3.1, 3.2 can also be proved easily)]^ Prom (48 1 and (49) we obtain 



Egit + l) = E^{t + l 



D 



(50) 



u G Vgl {t + 1). Then u is unguarded and adjacent to some uv G Eg {t + 1) 



IV. Now suppose 

Eg (t + 1). Hence either u or u is n-dirty and so (since u is unguarded) u G (t + 1). On the other 
hand, if u G Vj^ {t + 1), then u is unguarded and there exists a uv £ Eg {t + 1) 
(since u is unguarded) u G Vg {t + 1). It follows that 



Eg {t + 1) and 



Vgit + l)=Vff{t + l) 



V. In short, we have established that 



Eg {t) = Eg {t) 



Vg it) 



ygit) 



Eg{t + l) 
VP 



Eg{t + l) 

yg{t+i) 



Eq.(51) and the fact Vgj (0) 



y^(0) and Eg {G) 



M (t + 1) 

E^ (0) complete the proof of the theorem. 



(51) 



□ 



Several facts follow from Theorem B.l First, it is well known that mixed edge search is NP-complete 
[9] hence node search is also NP-complete. 



Second, Theorem B.l can be used to obtain a Theorem 3.3 as a corollary with a short proof (which. 



of course, presupposes the lengthy proof of Theorem B.l ). We give a sketch of such a proof. Suppose a 
graph G and a search schedule S are given. Then, by Theorem B.l we have (for every t) Vg (t) = Vg (t) 
and Eg{t) = Eg{t). Now suppose S is applied to G under the rules of the edge game and, up to 
time t we have (t) C Vg (t) and E^ (t) C Eg (t). Then, at time t + 1, every e-cleared edge is also 
m-cleared (since the mixed game has all the clearing rules of the edge game and an additional one) and 
every m-dirtied edge is also e-dirtied (since the edge and mixed games have the same recontamination 
rules) except if an edge was already e-dirty. Hence E^ (t -|- 1) C Eg {t + 1), which also shows that 
V^{t + l)CVg{t + l). 

Finally, the above Theorem B.l and the monotonicity result proved in [5] ("if there is a mixed edge 
clearing search of G using < K guards, there is a monotone mixed edge clearing search of G using 
< K guards") makes the following conjecture seem almost trivially true. 



^Basically the proof remains valid because the recontamination rules remain the same. 



46 



Conjecture B.2. If there is a node clearing search S o/G using at most K guards, there is a monotone 
node clearing search S' of G using at most K guards. 

However some additional work is required to prove the conjecture, because of the following detail: 
edge monotonicity does not necessarily imply node monotonicity. In other words, the implication 

E^'n it) ^E%(t + l)^ it) C (t + 1) (52) 



is not necessarily true! Consider again the search schedule of Remark 2.2 In this case E'^ {1) 



(2) = 0, but (1) C (2) does not hold. It is true however (and easy to prove) that 

(strong edge monotonicity) =^ (node monotonicity). (53) 

Unfortunately, the result proved by Bienstock in [S] concerns "simple", not strong monotonicity. 
Hence one of our future research goals is to extend Bienstock's result to node search and thus prove 
that sn{G) = (G). 
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C Appendix: Vertex separation and pathwidth 

Vertex separation and pathwidth are strongly (but not obviously) related to graph search and search 
number, as discussed in (among other papers) |29l [12], where the author can find a more detailed 
discussion; here we just give the basic definitions and the main theorem. 

Definition C.l. Given a graph G = (y,E), a path decomposition of G is a pair (X,P), where 
X = {Xi, ...,Xm} is a family of subsets of V and P is a path whose nodes are the subsets Xi and they 
satisfy the following properties 

1. U^^Li^i = V. 

2. For every edge vw in E, there is a subset Xm that contains both v and w. 

3. Ifl<i<j<k<M then Xi n Xk C Xj. 

Definition C.2. The width of a path decomposition (^,P) (with X = {Xi, ...,Xn}) is denoted by 
pdw {X, P) and is defined by 

pdw (X,P) = max iXil . 

l<i<n 

Definition C.3. The pathwidth of a graph G = (y,E) is denoted by pw{G) and defined by 

pw (G) = min pdw {X, P) . 

all path decompositions {X,F) of G 

Definition C.4. Given a graph G = (y,E) (with \V\ = N) and a permutation V of {1,2, ...,N}. The 
vertex separation of G with respect to V is denoted by vs{G,V) and defined by 

vs(G,V) = max Ijn G V : V (u) < n and 3v such that: V (v) > n and uv G E\\ . 

l<n<N 

The vertex separation of G is denoted by vs (G) and defined by 

vs{G)= min vs{G,V). 

all permutations V 

The following theorem shows the connection between vs (G) and pw (G) and search number. 
Theorem C.5. For every graph G, vs{G) = pw {G) and \vs{G) — se{G)\ < 1. 
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D Appendix: Implementations of the GSearch Algorithm 



We have implemented the GSST algorithm in two forms which are publicly available. 

1. As a command line executable, which runs on Windows and Linux computers. The program and 
supporting material are available at the URL 

http : / /www . f rc . ri . emu . edu/~gholling/home/ software . html. 

2. As a graphical user interface (GUI), available at 

http : //users . auth . gr/~kehagiat/KehagiasSof tware . htm. 

We discuss each of these implementations separately. 
D.l The Command Line program 

This is an executable, gsearch.exe, which has the following usage 

USAGE: gsearch -m [graph] -n [no. trees] -s [startnode] -g [gen. tree] 
-y [wr-search] -w [wr-tree] -v [visualize] -t [edge traversal] 
-i [improve tree] -1 [low number] -r [redundancy check] 

EXAMPLE: gsearch -m graphs/EdgeOl .txt -g exhaustive -s 1 -n 500000 -t bh 

The various options available are as follows. 

-m [graph] : string, name of file with edge list of the graph. 

-n [no. trees] : int, how many sp. trees to generate (DEFAULT is 1). 

-s [start node] : int, which node to start 

(DEFAULT is 1, random choice is 0) 
-g [gen. tree] : string, method of generating spanning trees 

(acceptable values: readtree, exhaustive, uniform, dfsrand; 
DEFAULT is uniform) 
-y [wr-search] : string, how many best searches to write 

(acceptable values: one, all, none; DEFAULT is one) 
-w [wr-tree] : string, how many best sp. trees to write 
(acceptable values: one, all, none; DEFAULT is one) 
-t [traversal] : string, how to break edge label ties 

(acceptable values: bh, random, bhrand, bhweight, bhdom; 
DEFAULT is bh) . 

-i [improve tree]: boolean, use tree improve technique or not. 

(acceptable values are 0/1; DEFAULT is 0: do not use it) 
-r [redimdancy check] : boolean, check for redundant trees 
(acceptable values are 0/1; DEFAULT 1: 

check for redundancy) 
-1 [low number] : break if a tree is found with this number of 

searchers (DEFAULT is 0: do not break) 
-V [visualize] : boolean, use visualizer (only supported 

on linux) (DEFAULT is 0: do not use) 
-h [help] 



49 



The -m option indicates the file which contains the graph description. This must be an ascii (plain 
text) file containing a list of the edges of the graph, one edge per line, indicated as a pair of nodes. 
The nodes must be continuously numbered from 1 to and these numbers are used as labels. N is 
assumed to be the largest number appearing in the edge list (the graph is assumed to be undirected 
and connected). Examples of edge lists can be found in the graphs directory. 

The remaining options of gsearch correspond to the description of the algorithm in Section [5} The 
-n option corresponds to the M parameter of GSST (number of spanning trees), -s is the root of the 
search, -g describes the uniform and DFS methods for generating spanning trees (there are also options 
for exhaustive enumeration of all spanning trees and for reading a specific spanning tree from file) and 
the -t options (bh , random, bhrand, bhweight, bhdom) corresponds to the traversal methods (L, R, 
LR, LW and LD, respectively). The -i, -r and -1 options are self explanatory. The -y option writes 
one or more minimal strategies in the file(s) output/strat* .txt; each row of the file corresponds to 
one step of the strategy and shows the nodes in which the searchers are currently located. The -w 
option writes the rooted tree(s) corresponding to optimal strategies, in the file(s) output/tree* .txt; 
each row of the file shows one parent and her child. The -v visualization option only works on Linux 
computers (for a visualization of the search on Windows computers use the GUI). 

The command line gsearch was implemented by G. HoUinger in ANSI C using the gcc 4.1 compiler. 



D.2 The Graphical Interface 

This is an executable, gsearchGUI.exe, which corresponds closely to the command line program. 
(In fact the GUI is a front end for the command line program.) Launching the program brings up 
the window of Figure 14 On the left side we see several input boxes. All of these correspond to 



abovementioned options of the command line program with one exception: in addition to the edge list 



file (graphs/Edge 11 . txt in Figure 14) there is also a file containing the x- and y-coordinates of the 
nodes (graphs/Node 11 .txt in Figure 14 - this is required for visualization). The GUI starts with the 



default selection of EdgeOl . txt and NodeOl . txt ; if you want a different graph, type the corresponding 
file names in the input boxes. 



Node Search 



Node File 
Edge File 

Num. of Spanning Trees 
Starting Node 
Spanning Tree Generation 
How Many Searches to Save 
How Many SpTrees to Save 
How to Break Edge Ties 
Improve Tree? 
Check Redundancy 
Low Number Break 



graphs/Edge11.tKt 



graphs/Nodell.tHt 



5000 
T 



Plot Graph 



Search 



Plot Search 



Random Uniform 



No 



"3 
-3 

~3 



Help 



Exit 



Figure 14: A screenshot of the gsearch GUI. 
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Having completed the input boxes on the left side of the window, you can now use the buttons on 
the right side. 



The Search button performs the graph search (it actually invokes the gsearch executable and 
stores results in the output directory). Time to complete the search depends on the size of 
the graph and the number of spanning trees used. Do not use the exhaustive spanning tree 
enumeration option on large graphs because the program may take too long to terminate. 

The Plot Graph button plots the graph. An image of the graph is generated and an image viewer is 
invoked which can be used to view the image file (this is always named graph . j pg) . A screenshot 
of the image viewer appears in Figure IT5| 



The Plot Search button plots the graph search. Actually, first the search strategy files are used 
to generate a sequence of image files, one image corresponding to each step of the search (these 
files are stored in the directory pix) and then the image viewer is invoked to view the files. The 
user can step through the images using the arrow keys, or use the start / stop buttons to run a 



slide show of the search. A screenshot of the image viewer appears in Figure 15 



MyDesktopHelp. Corn's AutoPics [K:\v1\grfsrch61\pix\image0.0004.jpg] 



Q grfsrchGI 



imageO.OOOI. 
imaoeO.0002. 
imageO.0003. 

imageO.OOOG. 
imageO.0007. 
imageO.0008. 
imageO.0009. 
imageO.0010. 
imageO.0011. 



Start 



Stop 




Hide Side Bar 



Figure 15: A screenshot of the image viewer. 



Nota bene: do not run Plot Search before running Search! The Plot Search function uses the latest 
results of Search; if these correspond to an earlier graph (from a previous run of the GUI) the plotted 
search will produce nonsense reults. 

The GUI was implemented by Ath. Kehagias in MS Visual Basic 5.0 (and a little bit of C, using the 
dev-cpp 4.9.9.2 environment). The image viewer is a freeware program called AutoPics and made avail- 
able by the company Mydesktophelp (at http://www.mydesktophelp.com). The GD graphics library 
(version gd-2.0.34-win32) has also been used; this library is available at http://www.libgd.org/. 

Finally it must be stressed that the current version , gsearchGUI v. 0.9, is still beta and requires 
further development. However we make it publicly available in the hope that it will prove useful to the 
graph search community. 
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